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To Paul Erdos, 
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a friend and collaborator for 25 years, and a 
master of elementary methods in number theory. 


Preface 


Arithmetic is where numbers run across your mind looking for 
the answer. 

Arithmetic is like numbers spinning in your head faster and 
faster until you blow up with the answer. 

KABOOM!!! 

Then you sit back down and begin the next problem. 


Alexander Nathanson [99] 


This book, Elementary Methods in Number Theory, is divided into three 
parts. 

Part I, “A first course in number theory,” is a basic introduction to el- 
ementary number theory for undergraduate and graduate students with 
no previous knowledge of the subject. The only prerequisites are a little 
calculus and algebra, and the imagination and perseverance to follow a 
mathematical argument. The main topics are divisibility and congruences. 
We prove Gauss’s law of quadratic reciprocity, and we determine the moduli 
for which primitive roots exist. There is an introduction to Fourier anal- 
ysis on finite abelian groups, with applications to Gauss sums. A chapter 
is devoted to the abc conjecture, a simply stated but profound assertion 
about the relationship between the additive and multiplicative properties 
of integers that is a major unsolved problem in number theory. 

The “first course” contains all of the results in number theory that are 
needed to understand the author’s graduate texts, Additive Number Theory: 
The Classical Bases [104] and Additive Number Theory: Inverse Problems 
and the Geometry of Sumsets [103]. 
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The second and third parts of this book are more difficult than the “first 
course,” and require an undergraduate course in advanced calculus or real 
analysis. 

Part II is concerned with prime numbers, divisors, and other topics in 
multiplicative number theory. After deriving properties of the basic arith- 
metic functions, we obtain important results about divisor functions, and 
we prove the classical theorems of Chebyshev and Mertens on the distribu- 
tion of prime numbers. Finally, we give elementary proofs of two of the most 
famous results in mathematics, the prime number theorem, which states 
that the number of primes up to x is asymptotically equal to x/log x, and 
Dirichlet’s theorem on the infinitude of primes in arithmetic progressions. 

Part III, “Three problems in additive number theory,” is an introduction 
to some classical problems about the additive structure of the integers. The 
first additive problem is Waring’s problem, the statement that, for every 
integer k > 2, every nonnegative integer can be represented as the sum 
of a bounded number of kth powers. More generally, let f(a) = aga* + 
ap—12*-!+---+ a9 be an integer-valued polynomial with a, > 0 such that 
the integers in the set A(f) = {f(a) : = 0,1,2,...} have no common 
divisor greater than one. Waring’s problem for polynomials states that 
every sufficiently large integer can be represented as the sum of a bounded 
number of elements of A(f). 

The second additive problem is sums of squares. For every s > 1 we 
denote by R,(n) the number of representations of the integer n as a sum 
of s squares, that is, the number of solutions of the equation 

n= zy Seer x 
in integers 21,...,2,. The shape of the function R,(n) depends on the 
parity of s. In this book we derive formulae for R,(n) for certain even 
values of s, in particular, for s = 2,4,6,8, and 10. 

The third additive problem is the asymptotics of partition functions. 
A partition of a positive integer n is a representation of n in the form 
nm = a, +-+:+ ax, where the parts aj,...,@% are positive integers and 
a, > +++ > ag. The partition function p(n) counts the number of partitions 
of n. More generally, if A is any nonempty set of positive integers, the 
partition function pa(n) counts the number of partitions of n with parts 
belonging to the set A. We shall determine the asymptotic growth of p(n) 
and, more generally, of pa(n) for any set A of integers of positive density. 

This book contains many examples and exercises. By design, some of 
the exercises require old-fashioned manipulations and computations with 
pencil and paper. A few exercises require a calculator. Number theory, after 
all, begins with the positive integers, and students should get to know and 
love them. 

This book is also an introduction to the subject of “elementary methods 
in analytic number theory.” The theorems in this book are simple state- 
ments about integers, but the standard proofs require contour integration, 
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modular functions, estimates of exponential sums, and other tools of com- 
plex analysis. This is not unfair. In mathematics, when we want to prove a 
theorem, we may use any method. The rule is “no holds barred.” It is OK 
to use complex variables, algebraic geometry, cohomology theory, and the 
kitchen sink to obtain a proof. But once a theorem is proved, once we know 
that it is true, particularly if it is a simply stated and easily understood 
fact about the natural numbers, then we may want to find another proof, 
one that uses only “elementary arguments” from number theory. Elemen- 
tary proofs are not better than other proofs, nor are they necessarily easy. 
Indeed, they are often technically difficult, but they do satisfy the aesthetic 
boundary condition that they use only arithmetic arguments. 

This book contains elementary proofs of some deep results in number 
theory. We give the Erdés-Selberg proof of the prime number theorem, 
Linnik’s solution of Waring’s problem, Liouville’s still mysterious method 
to obtain explicit formulae for the number of representations of an integer 
as the sum of an even number of squares, and Erd6s’s method to obtain 
asymptotic estimates for partition functions. Some of these proofs have not 
previously appeared in a text. Indeed, many results in this book are new. 

Number theory is an ancient subject, but we still cannot answer the 
simplest and most natural questions about the integers. Important, easily 
stated, but still unsolved problems appear throughout the book. You should 
think about them and try to solve them. 


Melvyn B. Nathanson! 
Maplewood, New Jersey 
November 1, 1999 


1Supported in part by grants from the PSC-CUNY Research Award Program and the 
NSA Mathematical Sciences Program. This book was completed while I was visiting the 
Institute for Advanced Study in Princeton, and I thank the Institute for its hospitality. 
I also thank Jacob Sturm for many helpful discussions about parts of this book. 


Notation and Conventions 


We denote the set of positive integers (also called the natural numbers) by 
N and the set of nonnegative integers by No. The integer, rational, real, 
and complex numbers are denoted by Z, Q, R, and C, respectively. The 
absolute value of z € C is |z|. We denote by Z” the group of lattice points 
in the n-dimensional Euclidean space R”. 

The integer part of the real number x, denoted by [2], is the largest 
integer that is less than or equal to x. The fractional part of x is denoted 
by {x}. Then x = [a] + {x}, where [a] € Z, {a} © R, and 0 < {x} < 1.In 
computer science, the integer part of x is often called the floor of x, and 
denoted by |a|. The smallest integer that is greater than or equal to x is 
called the ceiling of x and denoted by [2]. 

We adopt the standard convention that an empty sum of numbers is 
equal to 0 and an empty product is equal to 1. Similarly, an empty union 
of subsets of a set X is equal to the empty set, and an empty intersection 
is equal to X. 

We denote the cardinality of the set X by |X|. The largest element in a 
finite set of numbers is denoted by max(X) and the smallest is denoted by 
min(X). 

Let a and d be integers. We write dla if d divides a, that is, if there exists 
an integer g such that a = dq. The integers a and b are called congruent 
modulo m, denoted by a=b (mod m), if m divides a — b. 

A prime number is an integer p > 1 whose only divisors are 1 and p. 
The set of prime numbers is denoted by P, and px is the kth prime. Thus, 
pi = 2,p2 = 3,...,pi1 = 31,.... Let p be a prime number. We write p"||n 
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if p” is the largest power of p that divides the integer n, that is, p” divides 
n but p+! does not divide n. 

The greatest common divisor and the least common multiple of the inte- 
gers a1,...,@, are denoted by (a1,...,a,) and [a,,...,a,], respectively. If 
A is a nonempty set of integers, then gcd(A) denotes the greatest common 
divisor of the elements of A. 

The principle of mathematical induction states that if S(k) is some state- 
ment about integers k > ko such that S'(ko) is true and such that the truth 
of S(k—1) implies the truth of S(k), then S(k) holds for all integers k > ko. 
This is equivalent to the minimum principle: A nonempty set of integers 
bounded below contains a smallest element. 

Let f be a complex-valued function with domain D, and let g be a 
function on D such that g(x) > 0 for all x € D. We write f < g or 
f = O(g) if there exists a constant c > 0 such that |f(x)| < cg(x) for 
all « € D. Similarly, we write f >> g if there exists a constant c > 0 
such that |f(x)| > cg(a) for all  € D. For example, f >> 1 means that 
f(z) is uniformly bounded away from 0, that is, there exists a constant 
c > 0 such that |f(x)| > c for all « € D. We write f <x ,... g if there 
exists a positive constant c that depends on the variables k, ¢,... such that 
_.. g Similarly. The functions 
f and g are called asymptotic as x approaches a if limz—.a f(x)/g(x) = 1. 
Positive-valued functions f and g with domain D have the same order of 
magnitude if f<g< f, or equivalently, if there exist positive constants c1 
and cp such that c, < f(x)/g(a) < ce for all x € D. The counting function 
of a set A of integers counts the number of positive integers in A that do 
not exceed x, that is, 

A(z) = S> 1. 


acA 

l<a<a 
Using the counting function, we can associate various densities to the set 
A. The Shnirel’man density of A is 


The lower asymptotic density of A is 


dh Siete 


n—-0o n 


The upper asymptotic density of A is 


dy (A) = limsup i) 
If dy(A) = dy (A), then d(A) = dz(A) is called the asymptotic density of 
A, and 
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Let A and B be nonempty sets of integers and d € Z. We define 


the sumset 
A+B={at+b:aceA,beE Bh, 


the difference set 
A-B={a-b:aceA,beE Bh, 
the product set 
AB = {ab: ae A,beE Bh, 


and the dilation 
dx A= {d}A={da:ae A}. 


The sets A and B eventually coincide, denoted by A ~ B, if there exists 
an integer m9 such that n € A if and only if n € B for all n > no. 
We use the following arithmetic functions: 


Up(n) the exponent of the highest power of p that divides n 
p(n) Euler phi function 

p(n) Mobius function 

d(n) the number of divisors of n 

a(n) the sum of the divisors of n 

1 (ax) the number of primes not exceeding x 
O(x), w(x) Chebyshev’s functions 

L(n) log n if n is prime and 0 otherwise 

w(n) the number of distinct prime divisors of n 
Q(n) the total number of prime divisors of n 
L(n) log n, the natural logarithm of n 

A(n) von Mangoldt function 

Ao(n) generalized von Mangoldt function 

1(n) 1 for all n 

6(n) lifn=landO0ifn>2 


A ring is always a ring with identity. We denote by R* the multiplicative 
group of units of R. A commutative ring R is a field if and only if R* = 
R\ {0}. If f(t) is a polynomial with coefficients in the ring R, then No(f) 
denotes the number of distinct zeros of f(t) in R. We denote by M,,(R) the 
ring of n x n matrices with coefficients in R. 

In the study of Liouville’s method, we use the symbol 


0 if n is not a square, 


{f()in=e = { fO ittn=L, £0. 
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Part I 


A First Course in Number 
Theory 


1 
Divisibility and Primes 


1.1 Division Algorithm 


Divisibility is a fundamental concept in number theory. Let a and d be 
integers. We say that d is a divisor of a, and that a is a multiple of d, if 
there exists an integer q such that 


a= dq. 


If d divides a, we write 

dla. 
For example, 1001 is divisible by 7 and 13. Divisibility is transitive: If a 
divides b and 6 divides c, then a divides c (Exercise 14). 

The minimum principle states that every nonempty set of integers bounded 
below contains a smallest element. For example, a nonempty set of nonneg- 
ative integers must contain a smallest element. We can see the necessity of 
the condition that the nonempty set be bounded below by considering the 
example of the set Z of all integers, positive, negative, and zero. 

The minimum principle is all we need to prove the following important 
result. 


Theorem 1.1 (Division algorithm) Let a and d be integers with d > 1. 
There exist unique integers q and r such that 


a=dq+r (1.1) 


and 


O0<r<d-1. (1.2) 
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The integer q is called the quotient and the integer r is called the re- 
mainder in the division of a by d. 
Proof. Consider the set S of nonnegative integers of the form 


a—dx 


with « € Z. Ifa >0, thena=a-—d-0€ S. Ifa <0, let x = —y, where 
y is a positive integer. Since d is positive, we have a—dx =a+dyeS 
if y is sufficiently large. Therefore, S is a nonempty set of nonnegative 
integers. By the minimum principle, S$ contains a smallest element r, and 
r=a-—dq> 0 for some q € Z. If r > d, then 


0<r—d=a-dq+l1)<r 


and r —d € S, which contradicts the minimality of r. Therefore, g and r 
satisfy conditions (1.1) and (1.2). 
Let q1,71, 2,72 be integers such that 


a=dq+7r=dq+r2 and O<ryre<d-1. 
Then 
lry — ral <d-l1 
and 
dn — qa) =7T2—-T11. 


If q # qo, then 
la — | >1 
and 
d<dlq — | =|r2-ri| <d—-1, 


which is impossible. Therefore, qj = q2 and r; = rg. This proves that the 
quotient and remainder are unique. 


For example, division of 16 by 7 gives the quotient 2 and the remainder 
2, that is, 
16=7-2+2. 


Division of —16 by 7 gives the quotient —3 and the remainder 5, that is, 
—16 = 7(-3) +5. 


A simple geometric way to picture the division algorithm is to imagine 
the real number line with dots at the positive integers. Let gq be a positive 
integer, and put a large dot on each multiple of g. The integer a either 
lies on one of these large dots, in which case a is a multiple of q, or a lies 
on a dot strictly between two large dots, that is, between two successive 
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multiples of g, and the distance r between a and the largest multiple of q 
that is less than a is a positive integer no greater than q—1. For example, 
if qg=7 and a= +16, we have the following picture. 


-16 16 


-21 -14 -7 0 7 14 21 


The principle of mathematical induction states that if S(k) is some state- 
ment about integers k > ko such that S'(ko) is true and such that the truth 
of S(k—1) implies the truth of S(k), then S(k) holds for all integers k > ko. 
Another form of the principle of mathematical induction states that if S(ko) 
is true and if the truth of S(ko), S(ko + 1),...,S(& — 1) implies the truth 
of S(k), then S(k) holds for all integers k > ko. Mathematical induction is 
equivalent to the minimum principle (Exercise 18). 

Using mathematical induction and the division algorithm, we can prove 
the existence and uniqueness of m-adic representations of integers. 


Theorem 1.2 Let m be an integer, m > 2. Every positive integer n can 
be represented uniquely in the form 


n=aotaym+tagm?+---+aym*, (1.3) 


where k is the nonnegative integer such that 


m* <n<m*t 


and ag, @1,...,@,% are integers such that 
l<a,<m-1 


and 
0O<a<m-1 fori =0,1,2,...,k—1. 


This is called the m-adic representation of n. The integers a; are called 
the digits of n to base m. Equivalently, we can write 


lo) 

i 

n= y aym, 
i=0 


where 0 < a; < m—1 for all 2, and a; = 0 for all sufficiently large integers 
i. 

Proof. For k > 0, let S(k) be the statement that every integer in the 
interval m* < n < m*+! has a unique m-adic representation. We use 
induction on k. The statement S(0) is true because if 1 < n < m, then 
n = do is the unique m-adic representation. 
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Let k > 1, and assume that the statements S(0),S(1),...,S(k —1) are 
true. We shall prove $(k). Let m* <n < m*+!. By the division algorithm, 
we can divide n by m* and obtain 


n= aym* +r, where 0 <r <mé*. 


Then 


k “EE 


0<m*—r<n—r=agm <n<mi*t. 


Dividing this inequality by m*, we obtain 0 < ay < _m. Since m and ax are 
integers, it follows that 
l<ap<m-l1. 

If r = 0, then n = aym* is an m-adic representation. If r > 1, then 
m* <r <m*'*+! for some nonnegative integer k’ < k—1. By the induction 
assumption, Sk’) is true and r has a unique m-adic representation of the 
form 

r=agtaym+er:+ deine 
with 0 <a; < m—1 fori =0,1,...,k—1. It follows that n has the m-adic 
representation 


Nn=agtaym+-+-+ a,—ym*—1 + a,zm*. 


We shall show that this representation is unique. Let 


n=bo tbym+---+ bem 


be another m-adic representation of n, where 0 < b; < m-— 1 for all 
j=0,1,...,2and be > 1. If €>k+4+1, then 


n<m*t1 < bym! < n, 


which is impossible. If @< k — 1, then the inequalities b; < m— 1 imply 
that 


n = bo tbyn+---+bem 
< (m—1)+(m—1)m4+---+(m-—1)m! 
— moti_y 
< m* 
<n 


which is also impossible. Therefore, k = @. If a, < by, then 


n= ap ane bapa + apm? 
< (m—-1)+(m—-1)m4+---+(m—1)m*! + agm* 
= (m*—1)+agm* 
< (ag +1)m* 
< bym* 
<n 


Fs 


1.1 Division Algorithm 7 


which again is impossible. Therefore, b, < ax. By symmetry, we have ay, < 
by and so ax = by. Then 


n—a,ym* = agtayme4 agm? free me 
2 k-1 
bo + bym + bom +++++bp,_-1m 
< m*, 


By the induction assumption, a; = b; for i = 0,1,...,4 — 1. Thus, the 
m-adic representation of n exists and is unique, and S(k) is true. By math- 
ematical induction, S(k) holds for all k > 0. 


For example, the 2-adic representation of 100 is 
100 = 1-2? + 1-2°+1- 29, 
and the 3-adic representation of 100 is 
100 =1+2-37+1-3%. 
The 10-adic representation of 217 is 


217=7+1-10'+2-107. 


Exercises 
1. Find all divisors of 20. 


2. Find all divisors of 29,601. 
3. Find all divisors of 1. 


4. Find the quotient and remainder for a divided by d when 


(a) a = 281 and d= 23. 
(b) a = 281 and d= 12. 
(c) a = 291 and d= 23. 


(d) a= 291 and d= 12. 


5. Find the quotient and remainder for 10" + 1 divided by 11 for k = 
1,2,3,4, 5. 


6. Compute the m-adic representation of 526 for m = 2,3,7, and 9. 
7. Compute the 100-adic representation of 783,614,955. 


8. Prove that n is even, then n? is divisible by 4. 


13. 


14. 


15. 
16. 


17. 


18. 


19. 


20. 


1. Divisibility and Primes 


. Prove that n is odd, then n? — 1 is divisible by 8. 
. Prove that n® — n is divisible by 6 for every integer n. 


. Prove that if d divides a, then d* divides a* for every positive integer 


k. 


. Prove that if d divides a and d divides b, then d divides ax + by for 


all integers x and y. 


Prove that if a and d are integers such that d divides a and |a| < d, 
then a = 0. 


Prove that divisibility is transitive, that is, if a divides b and b divides 
c, then a divides c. 


Prove by induction that n < 2”~! for all positive integers n. 
Prove by induction that 


n(n + 1) 
2 


for all positive integers n. 
Prove by induction that 
1949934 ...4n3 =(142+4---4+n) 


for all positive integers n, that is, the sum of the cubes of the first n 
integers is equal to the square of the sum of the first n integers. 


Prove that the principle of mathematical induction is equivalent to 
the minimum principle. 


Let a and d be integers with d > 1. Prove that there exist unique 
integers q’ and r’ such that 

a=dd +r’ 
and 


-i<r< 


Q 
wNla 


For integers n and k with n > 1 and 0 < k < n, we define the binomial 


coefficient (" (n—1)---(n—k+1) 


k} kl 
Define (3) = 1. Prove that for all n > 1, 


(=) 


2). 


22. 


23. 


24. 


25. 
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(= (Ce) + Ga) 


Prove that the product of any k consecutive integers is always divis- 
ible by kl. 


Hint: Use induction on n to show that Cy is an integer. 


and 


forl<k<n-1. 


Let mg,™m1,M2,... be a strictly increasing sequence of positive inte- 
gers such that mp = 1 and m, divides m;+, for all i > 0. Prove that 
every positive integer n can be represented uniquely in the form 


foe) 
r= y axMi, 
1=0 


where 


O0<a<—'-1  foralli>0 
and m, = 0 for all but finitely many integers 2. 


Prove that every positive integer n can be represented uniquely in 
the form 


n= oa ark, 
k=0 
where 
0 < ak < k 


Prove that every positive integer n can be uniquely represented in 
the form 


i= ba by8 se ba8? oe bp? 3", 


where b; € {0,1,—1} for i=0,1,2,...,k—1. 


Let N* denote the set of all k-tuples of positive integers. We define the 
lexicographic order on N* as follows. For (a1,...,a@x),(b1,--., 0%) € 
N*, we write 

(a1,.--,@) Ss (b1,..-, dg) 


if either a; = 6; for alli = 1,...,k, or there exists an integer 7 such 
that a; = 6; for i <j and a; < 6;. Prove that 


(a) The relation = is reflexive in the sense that if (a1,...,ax) 
(b1,...,b~) and (b1,...,b%) X (a1,..-,@x%), then (a1,...,a%) 
(b1,.--, bx). 


WTA 
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(b) The relation ~ is transitive in the sense that if (a1,...,a%) X 
(b1,..., 0%) and (b1,...,0%) X (c1,..-, CK), then (a1,...,a%) X 
(c1, waey Ck). 


(c) The relation ~ is totalin the sense that if (a1,...,a%), (b1,.--, 0x) 
N*, them (@14>+<; 04) S (Gay ss~, 0g )-0F iy es ose) (iy. 6-50): 


A relation that is reflexive and transitive is called a partial order. 

A partial order that is total is called a total order. Thus, the lex- 
icographic order is a total order on the set of k-tuples of positive 
integers. 


26. Prove that N* with the lexicographic order satisfies the following 
minimum principle: Every nonempty set of k-tuples of positive inte- 
gers contains a smallest element. 


1.2 Greatest Common Divisors 


Algebra is a natural language to describe many results in elementary num- 
ber theory. 

Let G be a nonempty set, and let G x G denote the set of all ordered 
pairs (2, y) with x,y € G. A binary operation on G is a map from G x G 
into G. We denote the image of (a,y) € Gx Gby x yeEG. 

A group is a set G with a binary operation that satisfies the following 
three axioms: 


(i) Associativity: For all x,y, z € G, 
(wxy) kz =xu*(y*z). 


(ii) Identity element: There exists an element e € G such that for all 
reEeG, 
exn=uxe=Z. 


The element e is called the identity of the group. 
(iii) Inverses: For every x € G there exists an element y € G such that 
ueY=Y*U=eE, 
The element y is called the inverse of x. 


The group G is called abelian or commutative if the binary operation 
also satisfies the axiom 


(iv) Commutativity: For all z,y € G, 


LEY =Y*.. 
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We can use additive notation and denote the image of the ordered pair 
(1,y)€ Gx Gby e+ y. We call «+ y the sum of « and y. In an additive 
group, the identity is usually written 0, the inverse of x is written —a, and 
we define « — y = x + (—y). We can also use multiplicative notation and 
denote the image of the ordered pair (x,y) € G x G by xy. We call xy the 
product of « and y. In a multiplicative group, the identity is usually written 
1 and the inverse of x is written 2~!. 

Examples of abelian groups are the integers Z, the rational numbers Q, 
the real numbers R, and the complex numbers C, with the usual operation 
of addition. The nonzero rational, real, and complex numbers, denoted 
by Q*,R%*, and C”, respectively, are also abelian groups, with the usual 
multiplication as the binary operation. For every positive integer m, the 
set of complex numbers 


Tm = {e27*/™ : k =0,1,...,m—1} 


is a multiplicative group. The elements of T,,, are called mth roots of unity, 
since w™ = 1 for all w € T,,. An example of a nonabelian group is the set 
GL2(C) of 2 x 2 matrices with complex coefficients and nonzero determi- 
nant, and with the usual matrix multiplication as the binary operation. 

A subgroup of a group G is a nonempty subset of G that is also a group 
under the same binary operation as G. If H is a subgroup of G, then H is 
closed under the binary operation in G, H contains the identity element of 
G, and the inverse of every element of H belongs to H. For example, the 
set of even integers is a subgroup of Z. A nonempty subset H of an additive 
abelian group G is a subgroup if and only if «—y € A for allz,y ¢ A 
(Exercise 20). 

For every integer d, the set of all multiples of d is a subgroup of Z. We 
denote this subgroup by dZ. If a1,...,a,% € Z, then the set of all numbers 
of the form a,x, +--+: + axa, with x1,...,x2% € Z is also a subgroup of Z. 
The set Q of rational numbers is a subgroup of the additive group R. The 
set R* of positive real numbers is a subgroup of the multiplicative group 
R*. Let T = {z € C: |z| = 1} denote the set of complex numbers of 
absolute value 1, that is, the unit circle in the complex plane. Then T is a 
subgroup of the multiplicative group C*%, and I’, is a subgroup of T. 

If G is a group, written multiplicatively, and g € G, then g” € G for all 
n € Z (Exercise 21), and {g” : n € Z} is a subgroup of G. 

The intersection of a family of subgroups of a group G is a subgroup of G 
(Exercise 22). Let S' be a subset of a group G. The subgroup of G generated 
by S is the smallest subgroup of G that contains S. This is simply the 
intersection of all subgroups of G that contain S' (Exercise 23). For example, 
the subgroup of Z generated by the set {d} is dZ. 


Theorem 1.3 Let H be a subgroup of the integers under addition. There 
exists a unique nonnegative integer d such that H is the set of all multiples 
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of d, that is, 


H = {0,+d,+2d,...} = dZ. 


Proof. We have 0 € H for every subgroup H. If H = {0} is the zero 
subgroup, then we choose d = 0 and H = 0Z. Moreover, d = 0 is the unique 
generator of this subgroup. 

If H 4 {0}, then there exists a € H,a 4 0. Since —a also belongs to H, 
it follows that H contains positive integers. By the minimum principle, H 
contains a least positive integer d. By Exercise 21, dq € H for every integer 
q, and so dZ C H. 

Let a € H. By the division algorithm, we can write a = dq +7, where q 
and r are integers and 0 < r < d—1. Since dq € H and H is closed under 
subtraction, it follows that 


r=a-—dq€ H. 


Since 0 < r < dand d is the smallest positive integer in H, we must have 
r= 0, that is, a= dq € dZ and H C dZ. It follows that H = dZ. 

If H = dZ = dZ, where d and d’ are positive integers, then d’ € dZ 
implies that d’ = dq for some integer g, and d € d’Z implies that d = d’q’ 
for some integer q’. Therefore, 


d=d'q =dqq, 


and so qq’ = 1, hence g = q’ = +1 and d = <td’. Since d and d’ are positive, 
we have d = d’, and d is the unique positive integer that generates the 
subgroup H. 


For example, if H is the subgroup consisting of all integers of the form 
35a + 91y, then 7 = 35(—5) + 91(2) € H and H =7Z. 

Let A be a nonempty set of integers, not all 0. If the integer d divides a 
for all a € A, then d is called a common divisor of A. For example, 1 is a 
common divisor of every nonempty set of integers. The positive integer d 
is called the greatest common divisor of the set A, denoted by d = gcd(A), 
if d is a common divisor of A and every common divisor of A divides d. 
We shall prove that every nonempty set of integers has a greatest common 
divisor. 


Theorem 1.4 Let A be a nonempty set of integers, not all zero. Then A 
has a unique greatest common divisor, and there exist integers ay,...,Qzn € 
A and x1,...,X,% such that 


gcd(A) = aya, +--+ + aga. 
Proof. Let H be the subset of Z consisting of all integers of the form 


a1, +--+ + appre with a,,...,a, € Aand 2,..., 2, € Z. 
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Then H is a subgroup of Z and A C H. By Theorem 1.3, there exists 
a unique positive integer d such that H = dZ, that is, H consists of all 
multiples of d. In particular, every integer a € A is a multiple of d, and so d 
is a common divisor of A. Since d € H, there exist integers a1,...,a, € A 
and x1,...,2, such that 


d= a ,%, +++: + app. 


It follows that every common divisor of A must divide d, hence d is a 
greatest common divisor of A. 

If the positive integers d and d’ are both greatest common divisors, then 
d|d’ and d’|d, and so d = d’. It follows that gcd(A) is unique. 


If A = {aj,...,ax} is a nonempty, finite set of integers, not all 0, we 
write gcd(A) = (a,,...,@,). For example, 


(35,91) = 7 = 35(—5) + 91(2). 


Theorem 1.5 Let ai,...,a, be integers, not all zero. Then (a1,...,@%) = 
1 if and only if there exist integers 171,...,x2% such that 


Qt, +--+ + apr, = 1. 


Proof. This follows immediately from Theorem 1.4. 


The integers a,,...,@, are called relatively prime if their greatest com- 
mon divisor is 1, that is, (a1,...,@,) = 1. The integers a1,..., a, are called 
pairwise relatively prime if (a;,a;) = 1 for i # 7. For example, the three in- 
tegers 6, 10,15 are relatively prime but not pairwise relatively prime, since 
(6,10, 15) = 1 but (6,10) = 2, (6,15) = 3, and (10,15) = 5. 

Let G and H be groups, and denote the group operations by *. A map 
f :G— GF is called a group homomorphism if f(a * y) = f(x) * f(y) for 
all x,y € G. Thus, a homomorphism f from an additive group G into a 
multiplicative group H is a map such that f(z +y) = f(x) f(y) for all 
x,y € G. For example, if R is the additive group of real numbers and Rt 
is the multiplicative group of positive real numbers, then the exponential 
map exp: R — R* defined by exp(x) = e” is a homomorphism. 

A group homomorphism f : G — H is called an isomorphism if f is 
one-to-one and onto. Groups G and H are called isomorphic, denoted by 
G = H, if there exists an isomorphism between them. For example, let 2Z 
denote the additive group of even integers. The map f : Z — 2Z defined 
by f(n) = 2n is an isomorphism between the group of integers and the 
subgroup of even integers. 


14 1. Divisibility and Primes 
Exercises 
1. Compute (935, 1122). 
2. Compute (168, 252, 294). 
3. Find integers x and y such that 13a + 1l5y =1. 
4. Construct four relatively prime integers a,b,c,d such that no three 
of them are relatively prime. 
5. Prove that (n,n +2) =1 is n is odd and (n,n + 2) = 2 is n is even. 
6. Prove that 2n+5 and 3n+7 are relatively prime for every integer n. 
7. Prove that 3n+2 and 5n+3 are relatively prime for every integer n. 
8. Prove that n!+1 and (n+1)!+1 are relatively prime for every positive 
integer n. 
9. Let a,b, and d be positive integers. Prove that if (a,b) = 1 and d 
divides a, then (d,b) = 1. 
10. Let a and b be positive integers. Prove that (a,b) = a if and only if a 
divides b. 
11. Let a,b,c be positive integers. Prove that 
(ac, bc) = (a, b)e. 
12. Let a,b, and c be positive integers. Prove that 
((a, 6), c) = (a, (b, €)) = (a, 6, ¢). 
13. Let A be a nonempty set of integers. Prove that the greatest common 
divisor of A is the largest integer that divides every element of A. 
14. Let a,b,c,d be integers such that ad — bc = 1. For integers u and v, 


define 


wu = aut by 


and 


uv =cut+dv. 
Prove that (u,v) = (u’,v’). 


Hint: Express u and v in terms of u’ and v’. 
PD 


15. 


16. 


17. 


18. 


19. 
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Let S = Q’t! \ {(0,0,...,0)} denote the set of all nonzero (n + 1)- 
tuples of rational numbers. If t is a nonzero rational number and 


(%0,1,---,;%n) € S, then we define 

t(ao,21,---,@n) = (tao, ta1,..., tan) € S. 
We introduce a relation ~ on S as follows: If (ao, 21,...,2) and 
(Yo; Y1;---;Yn) are in S, then (x%9,%1,.--,2n) ~ (Yo, Y1,---5Yn) if 
there exists a nonzero rational number ¢ such that t(%o,21,-..,2n) = 
(Yo; ¥1;---;Yn)- Prove that this is an equivalence relation, that is, 


prove that ~ is reflexive (2 ~ x for all  € S), symmetric (if x ~ y, 
then y ~ x), and transitive (if x ~ y and y ~ z, then x ~ z). The set 
of equivalence classes of this relation is called n-dimensional projec- 
tive space over the field of rational numbers, and denoted by P”(Q). 


Consider (2, —5, 42) € Q3. Find all triples (ao, a1, a2) of relatively 
prime integers such that 


25 10 
el ena ea 
(ao, 41, 42) ( 6 ’ ’ 3 ) 


(@0,%1,---+)2n) ES= Q”*! \ {(0,0,...,0)}. 


Let [(a0,#1,---,;%n)] denote the equivalence class of (%o,21,..-,2n) 
in P”(Q). Prove that there exist exactly two elements (ao, @1,..-, Gn) 
and (bo, b1,..-,0n) in S such that the numbers ao, a1,...,@n are rel- 
atively prime integers, the numbers Oo, b1,..., 5, are relatively prime 
integers, and 


Let 


(x0, 21, wise ,Ln)| = [(ao, @1, fats ,On)] — [(bo, b1,. ‘ .,0n)| E P”(Q). 


Moreover, 
(bo, bi, eh , On) = —(ao, Q1,--- Gn). 


Prove that the set of all rational numbers of the form a/2*, where 
a€ Zand k € Ng, is an additive subgroup of Q. 


Let G = {2Z,1+2Z}, where 2Z denotes the set of even integers and 
1+ 2Z the set of odd integers. Define addition of elements of G by 


2Z +2Z = (14+2Z) +(14+2Z) =2Z 


and 
2Z4+ (1+ 2Z) = (14 2Z)+2Z=14 22. 


Prove that G is an additive abelian group. 
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20. 


21. 


22. 


23. 


24. 
25. 


26. 


27. 


28. 
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Let H be a nonempty subset of an additive abelian group G. Prove 
that H is a subgroup if and only if « — y € A for all x,y € H. 


Prove that if G is a group, written multiplicatively, and g € G, then 
g” € G for all n € Z. (If G is an additive group, then ng € G for all 
néZ.) 


Prove that the intersection of a family of subgroups of a group G is 
a subgroup of G. 


Let S be a nonempty subset of an additive abelian group G. Prove 
that the subgroup of G generated by S' is the intersection of all sub- 
groups of G that contain S. 


Prove that every nonzero subgroup of Z is isomorphic to Z. 


Let G be the set of all matrices of the form 


(0 1): 


with a € Z and matrix multiplication as the binary operation. Prove 
that G is an abelian group isomorphic to Z. 


Let H3(Z) be the set of all matrices of the form 
1 a 
0 1 
0 0 


with a,b,c € Z and matrix multiplication as the binary operation. 
Prove that H3(Z) is a nonabelian group. This group is called the 
Heisenberg group. 


Let R be the additive group of real numbers and Rt the multi- 
plicative group of positive real numbers. Let exp : R — Rt be the 
exponential map exp(x) = e”. Prove that the exponential map is a 
group isomorphism. 


Let G and H be groups with e the identity in H. Let f: G— A be 
a group homomorphism. The kernel of f is the set 


fol(e) ={2 eG: f(z) =e H} CG. 
The image of f is the set 
f(G) ={f(w) :a€ G} CH. 


Prove that the kernel of f is a subgroup of G, and the image of f is 
a subgroup of H. 
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29. Define the map f : Z — Z by f(n) = 3n. Prove that f is a group 
homomorphism and determine the kernel and image of f. 


30. Let I, denote the multiplicative group of mth roots of unity. Prove 
that the map f : Z — Im defined by f(k) = e?7*/™ is a group 
homomorphism. What is the kernel of this homomorphism? 


31. Let G = [0,1) be the interval of real numbers x such that 0 < x < 1. 
We define a binary operation x *« y for numbers x, y € G as follows: 


_f vty ife+y<l, 
vey={ xe+ty-1 ifea@ty>l. 


Prove that G is an abelian group with this operation. This group is 
denoted by R/Z. 


Define the map f : R — R/Z by f(t) = {t}, where {t} denotes the 
fractional part of t. Prove that f is a group homomorphism. What is 
the kernel of this homomorphism? 


1.3 The Euclidean Algorithm and Continued 
Fractions 


Let a and b be integers with b > 1. There is a simple and efficient method 
to compute the greatest common divisor of a and b and to express (a, b) 
explicitly in the form ax + by. Define ro = a and r; = b. By the division 
algorithm, there exist integers gg and r2 such that 


ro=T190 + T2 


and 
O<rg<1ry. 


If an integer d divides ro and r;, then d also divides r, and rg. Similarly, 
if an integer d divides r; and re, then d also divides rg and r;. Therefore, 
the set of common divisors of rg and 17; is the same as the set of common 
divisors of ry and rz, and so 


(a, b) = (70,71) = (11,12). 


If ro = 0, then a = bgp and (a,b) = b =r 1. If ro > 0, then we divide rz into 
r, and obtain integers gq, and r3 such that 


m1 =7reqd +73, 


where 
0 <73 << 712 <1, 
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and 

(a,b) = (r1,r2) = (72,73). 
Moreover, gi > 1 since rg <r. If r3 = 0, then (a,b) = ra. If rz > 0, then 
there exist integers q2 and r4 such that 

rg = 1342 +14, 
where qo > 1 and 
O<rmg<1rg<T2 <r, 

and 

(a, b) am (r2, 73) = (r3, 14). 


If rq = 0, then (a,b) = 73. 

Iterating this process & times, we obtain an integer qg, a sequence of 
positive integers q1,q2,---,Qk—-1, and a strictly decreasing sequence of non- 
negative integers 71,7T2,...,;7T +1 Such that 


Ti-1 = MiGi-1 Ti+ 
fori =1,2,...,k, and 
(a,b) = (r0,71) = (71, 72) = +++ = (Tk, TR41)- 
If rp41 > 0, then we can divide rz by rp41 and obtain 


Tk = Tk4+19k + Tk+2), 


where 0 < rpyeo < Te41. Since a strictly decreasing sequence of nonnegative 
integers must be finite, it follows that there exists an integer n > 1 such 
that rn41 = 0. Then we have an integer go, a sequence of positive inte- 
gers 1, G2,---;Qn—1, and a strictly decreasing sequence of positive integers 
11,72,-+-,Tn with 

(a,b) = (Tn, Tn41) =Tn- 


The n applications of the division algorithm produce n equations 


To = 1190 + T2 
ry = red +73 
T2 = 13427 Ta 
T—-2 = Tn-19n-2 +n 


Tn-1 = Tn n-1- 


Since rn < Tn41, it follows that gn_1 > 2. 
This procedure is called the Fuclidean algorithm. We call n the length 
of the Euclidean algorithm for a and b. This is the number of divisions 
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required to find the greatest common divisor. The sequence qo, q1,---;Qn—1 
is called the sequence of partial quotients. The sequence r2,73,...,1%n is 
called the sequence of remainders. 

Let us use the Euclidean algorithm to find (574,252) and express it as a 
linear combination of 574 and 252. We have 


574. = 252-2470, 
252 = 70-3+42, 
70 = 42-1428, 
AQ = 28-1414, 
2% = 14-2, 


and so 
(574, 252) = 14. 


The sequence of partial quotients is (2,3, 1, 1,2) and the sequence of partial 
remainders is (70, 42, 28,14). The Euclidean algorithm for 574 and 252 has 
length 5. Note that 574 = 14-41 and 252 = 14-18, and that 41 and 18 are 
relatively prime. Working backwards through the Euclidean algorithm to 
express 14 as a linear combination of 574 and 252, we obtain 

14 = 42—28-1 

= 42—(70—42-1)-1=42-2-—70-1 

(252 — 70-3)-2—70-1= 252-2—70-7 
252-2 — (574 — 252-2)-7 = 252-16 —574-7. 


I 


Let ao,a1,...,a@y be real numbers with a; > 0 for i = 1,...,N. We 
define the finite simple continued fraction 
1 
(do, @1,---,@N) = ao T 


Another notation for a continued fraction is 
1 1 1 


ai+ ag+ an 


(a9,@1,--.,@N) = a9 


The numbers ao, a1,...,ay are called the partial quotients of the continued 
fraction. For example, 
1 13 
(2,1,1,2) =2+——,- = —. 
1+ ae 5 
2 


We can write a finite simple continued fraction as a rational function in 
the variables ag,a,,...,ay. For example, 


(a0) = ao, 
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aga, +1 
(ao, @1) me 
ay 
and 
aja, ag + Ag + ag 
(ag, @1, 42) = 
ajag+1 
If N > 1, then (Exercise 5) 
1 
(ao, @1,---,4N) = a + ————_... 
(a1, seey an) 


We can use the Euclidean algorithm to write a rational number as a finite 
simple continued fraction with integral partial quotients. For example, to 
represent 574/274, we have 


5740 9 70 
252 252 
1 
— 2 Sa 7} 
3+ 7 
1 
— 2 + rT 
3+ 7 
42 
1 
= 2 + T 
3+ 5 
i+ 33 
1 
— 2 + T 
3+ 73 
its 
= (2,3, 1,152). 
Notice that the partial quotients in the Euclidean algorithm are the partial 


quotients in the continued fraction. 


Theorem 1.6 Let a and b be integers with b > 1. If the Euclidean algo- 
rithm fora and b has length n with sequence of partial quotients qo, q1,---;Qn—1; 
then 


a 


db _ Ldosdi pees Paes 


Proof. Let ro = a and r; = b. The proof is by induction on n. If n = 1, 
then 


ro = 1140 
and is 3 hes 
ba er 
If n = 2, then 
ro = T1G0 +12, 


rr = 12q1, 
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and 


TO T2 1 1 
7™ = 404 = (qo; 41): 


5 1 


Let n > 2, and assume that the theorem is true for integers a and b > 1 
whose Euclidean algorithm has length n. Let a and b > 1 be integers 
whose Euclidean algorithm has length n+ 1 and whose sequence of partial 
quotients is (go, q1,---;Qn). Let 


To = 140 7 1T2 
Tr = req +73 

Tn-1 = Tndn-1 + Trtl 
Tn = Tn4+19n- 


be the n + 1 equations in the Euclidean algorithm for a = rp and b = rj. 
The Euclidean algorithm for the positive integers r; and rg has length n 
with sequence of partial quotients qi,...,@n- It follows from the induction 
hypothesis that 


Tr 
mo (dies ,In) 
and so 
a To 1 
po Ota Ot egy oe) 


This completes the proof. 


It is also true that the representation of a rational number as a finite 
simple continued fraction is essentially unique (Exercise 8). 


Exercises 


1. Use the Euclidean algorithm to compute the greatest common divisor 
of 35 and 91, and to express (35,91) as a linear combination of 35 
and 91. Compute the simple continued fraction for 91/35. 


2. Use the Euclidean algorithm to write the greatest common divisor of 
4534 and 1876 as a linear combination of 4534 and 1876. Compute 
the simple continued fraction for 4534/1876. 


3. Use the Euclidean algorithm to compute the greatest common divisor 
of 1197 and 14280, and to express (1197, 14280) as a linear combina- 
tion of 1197 and 14280. 
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Compute the simple continued fraction (2,1,2,1,1,4) to 4 decimal 
places, and compare this number to e. 


Prove that 


1 
(ao, @1,---,@N) = @ + —————_.. 
(a1, seey an) 
Let N > 1. Prove that 
(ao, a1, ..+,Q@N—2,Q4N-1, 1) = (ao, Q1,..-,@N—2,4N-1 + 1). 
Let x = (ao,4@1,...,@n) be a finite simple continued fraction whose 


partial quotients a; are integers, with N > 1 and ay > 2. Let [2] 
denote the integer part of x and {x} the fractional part of x. Prove 
that 

[] = ao 


and 
1 


(a1,---,Q@N)_ 


{x} = 


Let § be a rational number that is not an integer. Prove that there 
exist unique integers ap, a ,...,@y such that a; > 1 fori =1,...,N- 
1, an > 2, and 


a 
3 = (a9, @1,...,@N—1,4N). 
Hint: By Exercise 7, if 
t= (ao, @1,---,@Nn) = (bo, b1,---, bar) 


with a;,b; € Z and ay, bas > 2, then ag = [2] = bo. 


. Prove that 
1 
(do, 01,---,@N,@n41) = (@9,@1,---,€@nN + ). 
QN+1 
Let (ao,@1,...,@n) be a finite simple continued fraction. Define 
Po = 40, 
Pi = 4149 +1, 

and 

Pn = AnPn—1 + Pn—2 for n = 2,...,N. 
Define 


go = 1, 


q1 = 41, 


11. 


12. 


13. 


14. 


15. 
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and 
dn = An Gn—1 + In—2 for n = 2,...,N. 
Prove that 
(ao, @1,---,@n) = BaD 
Qn 
for n = 0,1,...,N. The continued fraction (ao, a1,...,@n) is called 
the nth convergent of the continued fraction (ao, a1,...,@N). 


Compute the convergents pn/qn of the simple continued fraction 
(1,2,2,2,2,2,2). Compute pg/gqeg to 5 decimal places, and compare 
this number to V2. 


Let (ao,@1,-.-,@n) be a finite simple continued fraction, and let py, 
and gq, be the numbers defined in Exercise 10. Prove that 

Pn4n-1 — Pn-194n = (-1)""? 
and for n = 1,...,N. Prove that if a; € Z for i = 0,1,...,N, then 
(Pn; Qn) = 1 forn=0,1,...,N. 


Let (ao, @1,-.-,@n) be a finite simple continued fraction, and let p, 
and q, be the numbers defined in Exercise 10. Prove that 


n 


PnQn-2 — Pn-249n = (=) an 
for n= 2,...,N. 


Let « = (ao,a1,...,@n) be a finite simple continued fraction, and 
let pp and gq, be the numbers defined in Exercise 10. Prove that 
the even convergents are strictly increasing, the odd convergents are 
strictly decreasing, and every even convergent is less than every odd 
convergent, that is, 

Po _ p2 _ Pa Ps _ P3 _ Pi 


— << <-<@de Hc SH Kc. 
qo q2 g4 q5 93 1 


We define a sequence of integers as follows: 


fo = 0, 
fi = 1, 
fn = fn—-1+ fr—2 for n > 2. 


The integer f, is called the nth Fibonacci number. Compute the Fi- 
bonacci numbers f,, for n = 2,3,...,12. Prove that (fn, fn41) = 1 
for all nonnegative integers n. 


In Exercises 16-23, f,, denotes the nth Fibonacci number. 
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16. Compute the convergents p,/q, of the simple continued fraction 
(1,1,1,1,1,1,1). Observe that 


y] 9 y 9 > ’ 


Pn = fn41 
dn fn 


for n = 0,1,...,6. 
17. Prove that 
fit fote-+ + fn = fr42-1 


for all positive integers n. 


18. Prove that 
fasifn—1 — fh = (-1)” 


for all positive integers n. 


19. Prove that 
fn = frtifn—k + fefn—r-1 


for all k =0,1,...,n. Equivalently, 


fn fn—-1 + fn—2 = 2fn—2 + fn—3 


3fin 3+2fn 4=5fn 4a+3fn Der oss 


I 


I 


20. Prove that f,, divides fe, for all positive integers 2. 


21. Prove that, for n > 1, 


(4 (8) 
fn dad - 1 0 : 


22. Let 
1+75 
a= 
2 
and J 
1-—¥V5 
p=-—. 
Prove that n_ gn 
a” — 
= for all n > 0. 
fer: 
Prove that e 
fn ~ as as N — OO 
and 


{20° for a2, 
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23. (Lamé’s theorem) Let a and b be positive integers with a > b. The 
length of the Euclidean algorithm for a and b, denoted by E(a,b), is 
the number of divisions required to find the greatest common divisor 
of a and b. Prove that 


& log b 


as rg 
log a 


E(a,b) 


where a = (1+ V5)/2. 
Hint: Let n = E(a,b). Set ro = a and r; = b. For i =1,...,n, let 


Ti-1 = TeGi-1 + Ti41; 


where the positive integers qo, q1,---;@n—1 are the partial quotients 
and r9,...,T—1,Tn are the remainders in the Euclidean algorithm. 
Then 


To >T1 >t > fn-1 > tn 21 
and (a, 6) = (79,71) =n. Let fn be the nth Fibonacci number. Since 
T 21= fo and rn_; > 2 = fs, it follows that 
Tr-2 = Tn-194n-2 +1 n 2 fa + fo = fa, 
Tn-3 = Tn-29n-3 +Tn-1 = fa + fs = fs; 


and, by induction on k, 
Tn—-k = Se+2 
for k = 0,1,...,n. In particular, 


b=r1> fari 2a. 


1.4. The Fundamental Theorem of Arithmetic 


A prime number is an integer p greater than 1 whose only positive divisors 
are 1 and p. A positive integer greater than 1 that is not prime is called 
composite. If n is composite, then it has a divisor d such that 1 <<d <n, 
and so n = dd’, where also 1 < d’ < n. The primes less than 100 are the 
following: 
2 3 5 7 Ii 
13 17 19 23 29 
31 37 41 43 = «47 
53 59 61 67 71 
73. 79 83 89 97. 


If dis a positive divisor of n, then d’ = n/d is called the conjugate divisor 
to d. Ifn =dd' and d < d’, then d < \/n. 
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We shall prove that every positive integer can be written as the product 
of prime numbers (with the convention that the empty product is equal to 
1), and that this representation is unique except for the order in which the 
prime factors are written. This result is called the fundamental theorem of 
arithmetic. 


Theorem 1.7 (Euclid’s lemma) Let a,b,c be integers. If a divides be 
and (a,b) =1, then a divides c. 


Proof. Since a divides bc, we have bc = aq for some integer q. Since a 
and 0 are relatively prime, Theorem 1.5 implies that there exist integers x 
and y such that 

1=az + by. 


Multiplying by c, we obtain 


c= acx + bey = acx + aqy = a(cx + qy), 


and so a divides c. This completes the proof. 


Theorem 1.8 Let k > 2, and let a,b, be,...,b% be integers. If (a,b;) =1 
for alli =1,...,k, then (a, b1b2--- by) = 1. 


Proof. The proof is by induction on k. Let k = 2 and d = (a,b1b2). We 
must show that d = 1. Since d divides a and (a,b,) = 1, it follows that 
(d, b,) = 1. Since d divides 6,b2, Euclid’s lemma implies that d divides bo. 
Therefore, d is a common divisor of a and bg, but (a, b2) = 1 and sod=1. 

Let k > 3, and assume that the result holds for k — 1. Let a,bi,..., bp 
be integers such that (a,b;) = 1 fori =1,...,k. The induction assumption 
implies that (a, b1---bp-1) = 1. Since we also have (a, b,x) = 1, it follows 
from the case k = 2 that (a,b, ---b,-1b,) = 1. This completes the proof. 


Theorem 1.9 Jf a prime number p divides a product of integers, then p 
divides one of the factors. 


Proof. Let b;, b2,...,b, be integers such that p divides b; --- by. By The- 
orem 1.8, we have (p,b;) > 1 for some i. Since p is prime, it follows that p 
divides ;. 


Theorem 1.10 (Fundamental theorem of arithmetic) Every positive 
integer can be written uniquely (up to order) as the product of prime num- 
bers. 
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Proof. First we prove that every positive integer can be written as a 
product of primes. Since an empty product is equal to 1, we can write 1 
as the empty product of primes. Let n > 2. Suppose that every positive 
integer less than n is a product of primes. If n is prime, we are done. If 
n is composite, then n = dd’, where 1 < d < d’ < n. By the induction 
hypothesis, d and d’ are both products of primes, and so n = dd’ is a 
product of primes. 

Next we use induction to prove that this representation is unique. The 
representation of 1 as the product of the empty set of primes is unique. 
Let n > 2 and assume that the statement is true for all positive inte- 
gers less than n. We must show that if n = p1---pr = pi---pp, where 
P1,-++)Pk,P4,---, pp are primes, then k = @ and there is a permutation o 
of 1,...,& such that p; = P(3) fori =1,...,k. By Theorem 1.9, since pz 
divides pj ---p), there exists an integer jo € {1,...,¢} such that p;, divides 
Pijg> and so pz = pj, since p’,, is prime. Therefore, 


£ 


n 
mae a ae I] %<” 


j=l 

J#IO 

It follows from the induction hypothesis that k — 1 = @— 1, and there is 
a one-to-one map o from {1,...,4 — 1} into {1,...,k} \ {jo} such that 
Di= P03) fori =1,...,k—1. Let o(k) = jo. This defines the permutation 
go, and the proof is complete. O 


For any nonzero integer n and prime number p, we define v,(n) as the 
greatest integer r such that p” divides n. Then v,(n) is a nonnegative 
integer, and u,(n) > 1 if and only if p divides n. If v,(n) =r, then we say 
that the prime power p" exactly divides n, and write p"||n. The standard 
factorization of n is 

n= [x : 


p|n 


Since every positive integer is divisible by only a finite number of primes, 


we can also write 
n= [[>"™. 
Pp 


where the product is an infinite product over the set of all prime numbers, 
and v,(n) = 0 and p’?\™ = 1 for all but finitely many primes p. The 
function v,(n) is called the p-adic value of n. It is completely additive in 
the sense that v,(mn) = vp(m) + vp(n) for all positive integers m and n 
(Exercise 13). For example, since n! = 1-2-3---n, we have 
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The standard factorizations of the first 60 integers are 


1=1 21=3-7 41=41 
2=2 22=2-11 42=2-3-7 
3=3 23 = 23 43 = 43 
4= 2? 24 = 23.3 44 = 2?-11 
5=5 25 = 5? 45 = 37-5 
6=2-3 26 =2-13 46 = 2-23 
we 273% 47 = 47 
Sa: 28277 48 = 24.3 
9 = 3? 29 = 29 49 = 7? 
10=2-5 30 =2-3-5 50 = 2-5? 
11=11 31 = 31 51=3-17 
12 = 27.3 32 = 2° 52 = 27-13 
13 = 138 33 =3-11 53 = 53 
14=2-7 34 =2-17 54 = 2-33 
15=3-5 35 =5-7 55=5-11 
16 = 24 36 = 2? . 3? 56 = 23.7 
17=17 37 = 37 57 =3-19 
18 = 2-3? 38 = 2-19 58 = 2-29 
19 = 19 39 =3-13 59 = 59 

20 = 27-5 40 = 23-5 60 = 27-3-5. 

Let a1,...,@, be nonzero integers. An integer m’ is called a common 
multiple of a,,...,a, if it is a multiple of a; for all 7 = 1,...,k, that is, 
every integer a; divides m’. The least common multiple of a,,...,a% iS a 
positive integer m such that m is a common multiple of a,,...,a@,%, and m 


divides every common multiple of aj,.. 


.,@,. For example, 910 is a common 


multiple of 35 and 91, and 455 is the least common multiple. We shall show 
that there is a unique least common multiple for every finite set of nonzero 
., @x] the least common multiple of a1,..., ax. 


integers. We denote by [a1,.. 


Theorem 1.11 Let ay,.. 


., ax be positive integers. Then 


(a1,...,ay) = [] pinto endestolau} 
Pp 


and 


Proof. This follows immediately from the fundamental theorem of arith- 


metic. 


Let x be a real number. Recall that the integer part of x is the greatest 
integer not exceeding x, that is, the unique integer n such that n < 2 < 
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n+1. We denote the integer part of x by [a]. For example, [4] = 1h): 
and [-4] = —2. The fractional part of x is the real number 

{x} = x — [2] € [0,1). 


Thus, {2} = 3 and {—3} = 2, We can use the greatest integer function 


to compute the standard factorization of factorials. 


Theorem 1.12 For every positive integer n and prime p, 


Proof. Let 1 < m < n. If p” divides m, then p”" < m<nandr< 
log n/ log p. Since r is an integer, we have r < [log n/log p| and 


The number of positive integers not exceeding n that are divisible by p” is 
exactly [n/p"], and so 


volnt) = So ep(m)= > S01 
m=1 m=1 beri 

fas] a DSS] 
FET y 

r=1 d r=1 z 


This completes the proof. 


We shall use Theorem 1.12 to compute the standard factorization of 10!. 
The primes not exceeding 10 are 2,3,5, and 7, and 


oso) = [2] + [2] + [2] =sr241<8 
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Therefore, 
10 2°9"5-7. 


For every nonzero integer m, the radical of m, denoted by rad(m), is the 
product of the distinct primes that divide m, that is, 


For example, rad(15) = rad(—45) = rad(225) = 15 and rad(p") = p for p 
prime and r > 1. 


Theorem 1.13 Let m and a be nonzero integers. There exists a positive 
integer k such that m divides a® if and only if rad(m) divides rad(a). 


Proof. We know that m divides a* if and only if vp(m) < vp(a*) = 
kvu,(a) for every prime p (Exercise 14). If there exists an integer k such 
that m divides a*, then v,(a) > 0 whenever vp(m) > 0, and so every prime 
that divides m also divides a. This implies that rad(m) divides rad(a). 

Conversely, if rad(m) divides rad(a), then vp(a) > 0 for every prime p 
such that v,(m) > 0. Since only finitely many primes divide m, it follows 
that there exists a positive integer k such that vp(a*) = kup(a) > vp(m) 


for all primes p, and so m divides a”. 


Exercises 


1. Factor 51,948 into a product of primes. 
2. Factor 10* + 1 into a product of primes for k = 1,2,3,4,5. 


3. Find the greatest common divisor and least common multiple of a = 
223° 7419" and 6395511713; 


. Compute the least common multiple of the integers 1, 2,3,..., 15. 
. Compute the standard factorization of 15!. 


. Prove that n,n+2,n+ 4 are all primes if and only if n = 3. 


. Prove that n,n+4,n+ 8 are all primes if and only if n = 3. 


oN ODO ao Bw 


. Let n > 2. Prove that (n+ 1)!+ k is composite for k = 2,...,n+4+ 1. 
This shows that there exist arbitrarily long intervals of composite 
numbers. 


9. Prove that n° — n is divisible by 30 for every integer n. 


10. Find all primes p such that 29p + 1 is a square. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


ZL. 


22. 


23. 


24. 
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The prime numbers p and q are called twin primes if |p — q| = 2. Let 
p and q be primes. Prove that pq+ 1 is a square if and only if p and 
q are twin primes. 


Prove that if p and q are twin primes greater than 3, then p+ q is 
divisible by 12. 


Let m,n, and k be positive integers. Prove that 
Up(mn) = Up(m) + vp(n) and Up(m*) = kup(m). 


Let d and m be nonzero integers. Prove that d divides m if and only 
if vp(d) < vp(m) for all primes p. 


Let m = Tee where pj,...,px are distinct primes, k > 2, and 
r; >1fori=1,...,k. Let m; = mp; * fori =1,...,k. Prove that 
(my1,...,Mpz) = 1. 


Let a,b, and c be positive integers. Prove that (ab,c) = 1 if and only 
if (a,c) = (b,c) =1. 


Prove that if 6 divides m, then there exist integers b and c such that 
m = bc and 6 divides neither 6 nor c. 


Prove the following statement or construct a counterexample: If d is 
composite and d divides m, then there exist integers b and c such 
that m = bc and d divides neither 6 nor c. 


Let a and 6 be positive integers. Prove that (a, bc) = (a, 6)(a,c) for 
every positive integer c if and only if (a,b) = 1. 


Let mj ,..., mx be pairwise relatively prime positive integers, and let 
d divide m,---mx. Prove that for each 71 = 1,...,k there exists a 
unique divisor d; of m; such that d= d,--- dx. 


Let n > 2. Prove that the equation y” = 2x” has no solution in 
positive integers. 


Let n > 2, and let x be a rational number. Prove that ¢/z is rational 
if and only if « = y” for some rational number y. 


Let m1,...,m, be positive integers and m = [m1,...,m x]. Prove 
that there exist positive integers d,...,d, such that d; is a divisor 
of m,; fora = 1,...,k, (dj,d;) = 1 for 1 <1 <7 < n, and m= 
[di,..., dx] = di--- dx. 


Prove that for any positive integers a and b, 


ab 


[a, b] = (5): 


32 


25 


26. 


27. 


28. 


29. 


30. 


3l. 


32. 


33. 


34. 
35. 
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. Let a and b be positive integers with (a,b) = d. Prove that 


ea] -> 


Prove that for any positive integers a, b,c, 
abc(a, b, c) 
(a, b,c] = ——_——.. 
(a, b)(b, e)(¢, a) 
Let a1,...,@% be positive integers. Prove that [a1,...,a@,] = a1--+- Gr 
if and only if the integers a1,...,a, are pairwise relatively prime. 


Let a and b be positive integers and p a prime. Prove that if p divides 
[a,b] and p divides a + b, then p divides (a, b). 


Let a and b be positive integers such that 
a+b=57 
and 
[a, b] = 680. 
Find a and b. 


Hint: Show that a and 6 are relatively prime. Then a(57 — a) = ab = 
a, b]. 


Let aZ = {ax : x € Z} denote the set of all multiples of a. Prove that 
for any integers a1,...,@x, 


k 
() aiZ = [ay,..., an|Z. 
i=l 


A positive integer is called square-free if it is the product of dis- 
tinct prime numbers. Prove that every positive integer can be written 
uniquely as the product of a square and a square-free integer. 


Prove that the set of all rational numbers of the form a/b, where 
a,b € Z and 6 is square-free, is an additive subgroup of Q. 


A powerful number is a positive integer n such that if a prime p 
divides n, then p? divides n. Prove that every powerful number can 
be written as the product of a square and a cube. Construct examples 
to show that this representation of powerful numbers is not unique. 


Prove that m is square-free if and only if rad(m) = m. 


Prove that rad(mn) = rad(m)rad(n) if and only if (m,n) = 1. 
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36. Let H = {1,5,9,...} be the arithmetic progression of all positive 
integers of the form 44+ 1. Elements of H are called Hilbert numbers. 
Show that H is closed under multiplication, that is, z,y € H implies 
ay € H. Anelement x of H will be called a Hilbert prime if x 4 1 and 
x cannot be written as the product of two strictly smaller elements 
of H. Compute all the Hilbert primes up to 100. Prove that every 
element of H can be factored into a product of Hilbert primes, but 
that unique factorization does not hold in H. 


Hint: Find two essentially distinct factorizations of 441 into a product 
of Hilbert primes. 


37. For n > 1, consider the rational number 


1 1 1 
hyn =1+ it 5te-45, 
a ae 


Prove that hy, is not an integer for any n > 2. 


Hint: Let 2° be the largest power of 2 not exceeding n. Let P be the 
product of the odd positive integers not exceeding n. Consider the 
number 2°-!Phy. 


1.5 Euclid’s Theorem and the Sieve of 
Eratosthenes 


How many primes are there? The fundamental theorem of arithmetic tells 
us that every number is uniquely the product of primes, but it does not 
give us the number of primes. Euclid proved that the number of primes is 
infinite. The following proof is also due to Euclid. It has retained its power 
for more than two thousand years. 


Theorem 1.14 (Euclid’s theorem) There are infinitely many primes. 


Proof. Let p1,...,pn be any finite set of prime numbers. Consider the 
integer 
N=pi-+'Pr tl. 


Since N > 1, it follows from the fundamental theorem of arithmetic that N 
is divisible by some prime p. If p = p; for some i = 1,...,n, then p divides 
N — p,-+-pPn = 1, which is absurd. Therefore, p # p; for all i = 1,...,n. 
This means that, for any finite set of primes, there always exists a prime 
that does not belong to the set, and so the number of primes is infinite. 


Let a(a) denote the number of primes not exceeding x. Then a(x) = 0 
for x < 2, r(x) = 1 for 2 < x < 3, w(x) = 2 for 3 < & < 5, and so on. 
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Euclid’s theorem says that there are infinitely many prime numbers, that 
is, 


lim (x) = 0, 
@w— Co 


but it does not tell us how to determine them. We can compute all the 
prime numbers up to x by using a beautiful and efficient method called the 
sieve of Eratosthenes. The sieve is based on a simple observation. If the 
positive integer n is composite, then n can be written in the form n = dd’, 
where 1 <d<d' <n. Ifd> Wn, then 


n=dd' > J/nJ/n=n, 


which is absurd. Therefore, if n is composite, then n has a divisor d such 
that 1 < d< 4/n. In particular, every composite number n < z is divisible 
by a prime p < \/@. 

To find all the primes up to x, we write down the integers between 1 
and x, and eliminate numbers from the list according to the following rule: 
Cross out 1. The first number in the list that is not eliminated is 2; cross 
out all multiples of 2 that are greater than 2. The iterative procedure is as 
follows: Let d be the smallest number on the list whose multiples have not 
already been eliminated. If d < ./z, then cross out all multiples of d that 
are greater than d. If d > ./z, stop. This algorithm must terminate after 
at most ,/x steps. The prime numbers up to x are the numbers that have 
not been crossed out. 

We shall demonstrate this method to find the prime numbers up to 60. 
We must sieve out by the prime numbers less than 60, that is, by 2,3, 5, 
and 7. Here is the list of numbers up to 60: 


1 2 3 4 5 6 7 8 9 10 
11 12 13 14 15 16 17 #18 #19 20 
21 22 23 24 25 26 27 28 29 30 
31 32 33 34 35 36 37 38 39 40 
41 42 43 44 45 46 47 48 49 50 
51 52 53 54 55 56 57 58 59 60 


We cross out 1 and all multiples of 2 beginning with 4: 
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Next we cross out all multiples of 3 beginning with 6: 


A DB fe 8 OT BB. AO 
11 A2 13 A4 A5 AG 17 AB 19 20 
fl £2 23 ~4 25 26 27 ps8 29 ZO 
31 £2 23 B4 35 26 37 28 B9 AO 
41 A2 43 Ad AD AG 47 AB 49 BO 
Bl 62 53 B4 55 B6 AT B8 59 60 


Next we cross out all multiples of 5 beginning with 10: 


A 2 3 A 5 6B 7 B P AO 
11 A2 13 A4 5 AG 17 AB 19 20 
pl 22 23 ~4 25 26 27 ps8 29 ZO 
31 £2 23 B4 25 26 37 28 B9 AO 
41 A2 43 Ad AD AG 47 AB 49 BO 
Bl 62 53 B4 B65 B6 fT B8 59 60 


Finally, we cross out all multiples of 7 beginning with 14: 


A 2 3 A S& 6 f 2 Pp. f0 
11 A2 13 A4 A5 AG 17 AB 19 20 
pl £2 23 ~4 25 26 27 ps8 29 BO 
31 £2 23 B4 25 26 37 28 BO AO 
41 A2 43 Ad AD AG 47 AB AD BO 
Bl £2 53 64 65 B6 fT B8 59 60 


The numbers that have not been crossed out are: 
2,3, 5, 7,11, 18,17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59. 


These are the prime numbers up to 60. 


Exercises 


1. Use the sieve of Eratosthenes to find the prime numbers up to 210. 
Compute 7(210). 


2. Let N = 210. Prove that N — p is prime for every prime p such that 
N/2 < p < N. Find a prime number gq < N/2 such that N — q is 
composite. 


3. Let N = 105. Show that N — 2” is prime whenever 2 < 2” < N. This 
statement is also true for N = 7,15,21,45, and 75. It is not known 
whether N = 105 is the largest integer with this property. 


4. Let N = 199. Show that N — 2n? is prime whenever 2n? < N. It is 
not known whether N = 199 is the largest integer with this property. 
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5. Let a and n be positive integers. Prove that a” — 1 is prime only 


10. 
11. 


if a = 2 and n = p= is prime. Primes of the form M, = 2? — 1 
are called Mersenne primes. Compute the first five Mersenne primes. 
The largest known primes are Mersenne primes. It is an unsolved 
problem to determine whether there are infinitely many Mersenne 
primes. There is a list of all known Mersenne primes in the Notes at 
the end of this chapter. 


. Let k be a positive integer. Prove that if 2*+1 is prime, then k = 2”. 


The integer 

F, =2?° +1 
is called the nth Fermat number. Primes of the form 2?" +1 are called 
Fermat primes. Show that F,, is prime for n = 1, 2,3, 4. 


. Prove that Fs is divisible by 641, and so Fs is composite. 


Hint: Observe that 
Pea a 1S? 6 582) (54 2 1) 
and 
641 = 24-+54=5-27 +1, 
Prove that 641 divides both 5+ - 228 + 23? and 54 . 228 — 1. 


It is an unsolved problem to determine whether there are infinitely 
many Fermat primes. Indeed, we do not know whether F;,, is prime 
for any n > 4. 


. Modify the proof of Theorem 1.14 to prove that there are infinitely 


many prime numbers whose remainder is 3 when divided by 4. 


Hint: Let pi,po,.--,Pn be primes of the form 4k + 3, p; 4 3. Let 
N = 4p, p2--: pn +3. Show that N must be divisible by some prime 
q of the form 4k + 3. 


. Show that every prime number except 2 and 3 has a remainder of 1 


or 5 when divided by 6. Prove that there are infinitely many prime 
numbers whose remainder is 5 when divided by 6. 


Prove that a(n) < n/2 for n> 8. 


Prove that a(n) < n/3 for n > 33. 


Hint: Prove the following assertions. (i) If no > 3, then there are 
at most two primes among the 6 consecutive integers no + 1,n9 + 
2,...,N9 + 6. (ii) Suppose that no > 3 and a(no) < no/3. Let 
n = no + 6k for some positive integer k. Then a(n) < n/3. (iii) 
Show (by computation) that (32) > 32/3 but a(no) < no/3 for 
mo = 33,34,...,38. (iv) Show that every integer n > 33 can be 
written in the form no + 6k for some nonnegative integer k and 
Mo € {33, 34,..., 38}. 
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12. Let no > 6. Prove that if t(no) < 49/15 and n = no + 30k, then 
mn) < 4n/15. 

13. Let 2 =p, < po <--- be the sequence of primes in increasing order. 
Prove that 


gr-1 


Pn <2 
for alln > 1. 


Hint: Show that the method used to prove Euclid’s theorem (Theo- 
rem 1.14) also proves that ppii <pr---Pn tl. 


14. Let log, x denote the logarithm of x to the base 2. Prove that 
m(x) > logy logy x 
for all a > 1. 
Hint: Exercise 14. 


15. Let p1,...,px be a finite set of prime numbers. Prove that the num- 
ber of positive integers n < ax that can be written in the form 
n= p;'---p;,* is at most 


Prove that if x is sufficiently large, then there are positive integers 
n < x that cannot be represented in this way. Use this to give another 
proof that the number of primes is infinite. 


1.6 A Linear Diophantine Equation 


A diophantine equation is an equation of the form 
f(a1,---,%%) =b 


that we want to solve in rational numbers, integers, or nonnegative integers. 
This means that the values of the variables 21,...,2,% will be rationals, 
integers, or nonnegative integers. Usually the function f(a1,...,7%) is a 
polynomial with rational or integer coefficients. 

In this section we consider the linear diophantine equation 


12, +--+ + apap = bz 


We want to know when this equation has a solution in integers, and when 
it has a solution in nonnegative integers. For example, the equation 


3271 +5249 =D 
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has a solution in integers for every integer b, and a solution in nonnegative 
integers for b = 0,3,5,6, and all b > 8 (Exercise 1). 


Theorem 1.15 Let a,,...,a, be integers, not all zero. For any integer b, 
there exist integers x1,...,x, such that 

ayxyt+-::+apr, = 5 (1.4) 
if and only if b is a multiple of (a1,...,a%). In particular, the linear equa- 


tion (1.4) has a solution for every integer b if and only if the numbers 
a1,.--,@~ are relatively prime. 


Proof. Let d = (a,...,a,). If equation (1.4) is solvable in integers 2;, 
then d divides b since d divides each integer a;. Conversely, if d divides 
b, then b = dq for some integer g. By Theorem 1.4, there exist integers 
Y1,-+-,Yk Such that 

ayyi +--+: + apy, = d. 


Let x; = yiq fori =1,...,k. Then 
Qt, +++ + Are = a1(yig) +++> + ax(yRg) = dq = b 


is a solution of (1.4). It follows that (1.4) is solvable in integers for every b 
if and only if (a1,...,a%) = 1. 


Theorem 1.16 Let a,,...,a, be positive integers such that 
(a1,.--,@r) = 1. 
If 
k-1 
b> (ax 3 1) Gs 
i=l 
then there exist nonnegative integers ©1,...,X£% such that 
121 +--+ + apap = bz 
Proof. By Theorem 1.15, there exist integers z1,..., 2, such that 
QZ, +--+ +apzy = DB. 


Using the division algorithm, we can divide each of the integers z1,..., 2-1 
by ax so that 
7 


and 
O0< a; <a,—-—1 
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fori =1,...,k—1. Let 
k-1 
Lk = 2k + >> aig. 
i=1 


Then 


b = ayzy tees + Ap-12ZK-1 + AkZK 


a1(Geq1 + £1) + +++ + Ge—1(AnGe—-1 + Le—-1) + GnZe 


k-1 
= ty teee + Ak-1UK-1 + Ag | Ze + s Qidi 
i=1 


= ary step Ak-1Uk-1 1 ARLE 
k-— 

= (ae=4) S Qj + Ap&r, 
i=l 


where xx is an integer, possibly negative. However, if 


k-1 
b> (ax = Ly) oy 
i=l 


then a,x, > 0 and so x; > 0. This completes the proof. 


Let aj,...,a% be relatively prime positive integers. Since every suffi- 
ciently large integer can be written as a nonnegative integral linear combi- 
nation of a1,...,@x, it follows that there exists a smallest integer 


Gack , Ak) 


such that every integer b > G(a1,..., a) can be represented in the form (1.4), 
where the variables x1,..., 2, are nonnegative integers. The example above 
shows that 


G(3,5) =8. 


The linear diophantine problem of Frobenius is to determine G(aj,...,@x) 
for all finite sets of relatively prime positive integers a,,...,a,. This is a 
difficult open problem, but there are some special cases where the solution 
is known. The following theorem solves the Frobenius problem in the case 
k=2. 


Theorem 1.17 Let a, and az be relatively prime positive integers. Then 


G(a1, a2) = (ay as 1)(ag = 1). 
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Proof. We saw in the proof of Theorem 1.15 that for every integer b 
there exist integers x, and x2 such that 


b= 4,21 + aX and O<2, <ag-1. (1.5) 
If we have another representation 
b= ayr} + der), and O<2,<a.-1, 


then 
ay(a1 — 2) = ae(ag — 22). 

Since az divides a1(%1—24) and (a1, a2) = 1, Euclid’s lemma (Theorem 1.7) 
implies that ag divides 71 — x. Then 21 = x}, since |a1 — x4| < ag — 1. It 
follows that #2 = x5, and so the representation (1.5) is unique. 

If the integer b cannot be represented as a nonnegative integral combina- 
tion of a, and az, then we must have x, < —1 in the representation (1.5). 
This implies that 


b= a4 21 + agxe < ai(ag — 1) + ae(—1) = (a, — 1)(ag — 1) - 1, 
and so G(a1, a2) < (a; — 1)(ag — 1). On the other hand, since 
a1(az — 1) + ag(—1) = aya — ay — a2 < ayaa, 
it follows that if 
ayag — A, — AQ = 412%, + A2QX2 


for any nonnegative integers 7, and x2, then 0 < 2; < ag —1. By the 
uniqueness of the representation (1.5), we must have x1; = a2 — 1 and 
x2 = —1. Therefore, the integer ajaz — a; — a2 cannot be represented as a 
nonnegative integral linear combination of a, and az, and so G(a1, a2) = 
(ay _ 1)(a2g _ 1). 


Exercises 
1. Prove that the equation 
321 + 5x2 =b 


has a solution in integers for every integer b, and a solution in non- 
negative integers for b = 0,3,5,6 and all b > 8. 


2. Find all solutions in nonnegative integers 71 and x2 of the linear 
diophantine equation 
2241 + 7x2 = 53. 
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. Find all solutions in nonnegative integers x71 and x2 of the linear 
diophantine equation 


2821 + 352xq = 136. 


. Let a, and az be relatively prime positive integers. Let N(a1, a2) 
denote the number of nonnegative integers that cannot be represented 
in the form 


a,x, + aQ%X2 


with x1, x2 nonnegative integers. Compute N(3, 10) and N(3, 10)/G(3, 10). 
. Compute N(7,8) and N(7,8)/G(7,8). 
. Find all nonnegative integers that cannot be represented by the form 
321 + 10x + 1423 
with 71, 72,x3 nonnegative integers. Compute G(3, 10, 14). 


. Let a1 and az be relatively prime positive integers. Let M be the set of 
all integers n such that 0 < n < a,a2—a,—a2 and n can be written in 
the form n = a,21 + a2%2, where x; and x2 are nonnegative integers. 
Let N be the set of all integers n such that 0 < n < aya2 — a, — ap 
and n cannot be written in the form n = a,2, + a2%2, where x, and 
Zq are nonnegative integers. Then || = N(a,, a2) and |M|+|N| = 
(a1 — 1)(az — 1). Let n € [0, aya2 — a — ag], and write n in the form 


N= 41%, +a9%, where0 <2, <ag-1. 
This representation is unique. Define the function f by 
f(n) = ayag — ay — ag —N = a; (ag —1 — 21) — ag(xq +1). 


Prove that f is an involution that maps M onto N and N onto M, 


and so 
-1 -1 
IM) = |W) = =e) 
and 
N (a1, @2) = 1 
G(a1, a2) a2 


. Find all solutions in nonnegative integers 21,22, and x3 of the linear 
diophantine equation 


6x21 + 10x%2 + 15x73 = 30. 
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11. 
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. Find all solutions in integers 21,22, and x3 of the system of linear 


diophantine equations 
321 + 5x%2q + 7x3 = 560, 
9x1 + 25x29 + 493 = 2920. 
Find all solutions of the Ramanujan-Nagell diophantine equation 
e+7=2” 
with x < 1000. 
Find all solutions of the Ljunggren diophantine equation 
a? —2y* =-1 
with x < 1000. 


When is the sum of a geometric progression equal to a power? Equiva- 
lently, what are the solutions of the exponential diophantine equation 


L+oter+---ta™=y" (1.6) 
in integers x,m, y,n greater than 2? Check that 
1843" 3 sat 
14747 +7) =20?, 


and 
1+18+18? = 7°. 


These are the only known solutions of (1.6). 


Notes 


I can hardly do better than go back to the Greeks. I will state 
and prove two of the famous theorems of Greek mathematics. 
They are ‘simple’ theorems, simple both in idea and in execu- 
tion, but there is no doubt at all about their being theorems of 
the highest class. Each is as fresh and significant as when it was 
discovered—two thousand years have not written a wrinkle on 
either of them. 


G. H. Hardy [51, p. 92] 
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Number theory is an ancient subject. The famous theorems to which Hardy 
refers are the theorems that there are infinitely many primes (Theorem 1.14) 
and that V2 is irrational (Exercise 22 in Section 1.4). These appear in Eu- 
clid’s Elements [61, Book IX, Proposition 20, and Book X, Proposition 9]. 
The Euclidean algorithm also appears in Euclid [61, Book VII, Proposi- 
tion 2]. For fragments of number theory in Babylonian mathematics, see 
Neugebauer [110] and van der Waerden [147]. 
There are many excellent introductions to elementary number theory. 
My favorite is Number Theory for Beginners by André Weil [152]. Two 
classic works are Hardy and Wright [60] and Landau [87]. Other inter- 
esting books are Davenport [22], Hua [68], Kumanduri and Romero [85] 
and Ireland and Rosen [72]. There are beautiful introductions to algebraic 
number theory by Borevich and Shafarevich [13], Hecke [63, 64], Lang [90], 
and Neukirch [111], and to analytic number theory by Apostol [3], Dav- 
enport [21], Rademacher [119], and Serre [131, 132]. An excellent survey 
volume is Manin and Panchishkin, Introduction to Number Theory [96]. 

The best history is Weil, Number Theory: An Approach through History. 
From Hammurapi to Legendre [153]. There is also Leonard Eugene Dick- 
son’s encyclopedic but unreadable three-volume History of the Theory of 
Numbers [25]. 

Guy’s Unsolved Problems in Number Theory [45] is a nice survey of un- 
usual problems and results in elementary number theory. 

For a refinement of Theorem 1.16, see Nathanson [101]. 

Lang’s Algebra [89] is the standard reference for the algebra used in this 
book. 

In October, 1999, only 38 Mersenne primes had been discovered. The list 
of these primes is as follows: 


27-1 21. 2-1 27-1 al 
9l7 _ 4 919 _ 4 931 _ 4 961 _ 4 989 _ 4 
9107 —] 9127 —] 9521 =i 9607 =f 91279 =| 
92203 =]; 92281 | 93217 aif! 94253 =| 94423 al 
99689 af 99941 —1 911213 =] 919937 =) 921701 —1 
923209 —] 944497 =," 986243 =) 9110503 —1 9132049 =]. 


9216091 4 9756839 __ 4 9859433 _ 4 91257787 _ 1 91398269 _ 1 
92976221 4 93021377 _1 96972593 _ 1. 


The largest prime known in October, 1999 was the Mersenne prime M¢972593. 
An Internet site devoted to Mersenne primes and related problems in num- 
ber theory is www.mersenne.org. 
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Congruences 


2.1 The Ring of Congruence Classes 


Let m be a positive integer. If a and 0 are integers such that a—6 is divisible 
by m, then we say that a and b are congruent modulo m, and write 


a=b (mod ™m). 


Integers a and b are called incongruent modulo m if they are not congruent 
modulo m. For example, —12 = 43 (mod 5) and —12 = 43 (mod 11), 
but —12 4 43 (mod 7). Every even integer is congruent to 0 modulo 2, 
and every odd integer is congruent to 1 modulo 2. If x is not divisible by 
3, then x? =1 (mod 3). 

Congruence modulo m is an equivalence relation, since for all integers 
a,b, and c we have 


(i) Reflexivity: a=a (mod m), 
(ii) Symmetry: Ifa=b (mod m), thenb=a (mod ™m), and 


(iii) Transitivity: If @ = b (mod m) and b=c (modmy), then a = c 
(mod m). 


Properties (i) and (ii) follow immediately from the definition of congruence. 
To prove (iii), we observe that if a = 6b (mod m) and b=c (mod ™m), 
then there exist integers x and y such that a — b = ma and b—c = my. 
Since 


a—c=(a—b)+(b-—c) =mr4+my=m(r+y), 
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it follows thata=c (mod m). The equivalence class of an integer a under 
this relation is called the congruence class of a modulo m, and written 
a+mZ. Thus, a+mZ is the set of all integers b such that b=a (mod m), 
that is, the set of all integers of the form a+ ma for some integer x. If 
(a+mZ)N (b+mZ) #0, then a+ mZ = b+mZ. We denote by Z/mZ 
the set of all congruence classes modulo m. 

A congruence class modulo m is also called a residue class modulo m. 

By the division algorithm, we can write every integer a in the form 
a= mq+r, where qg and r are integers and0 <r<m-1.Thena=r 
(mod m), and r is called the least nonnegative residue of a modulo m. 

Ifa@=0 (mod m) and jal < m, then a = 0, since 0 is the only integral 
multiple of m in the open interval (—m,m). This implies that if a = b 
(mod m) and |a—b| < m, then a = b. In particular, ifr1,r2 € {0,1,...,m— 
1} and ifa =r, (mod m) and a=rp (mod m), then r; = rg. Thus, 
every integer belongs to a unique congruence class of the form r + mZ, 
where 0 <r < m-—1, and so 


Z/mZ = {mZ,1+mZ,...,(m—1)+mZ}. 


The integers 0,1,...,— 1 are pairwise incongruent modulo m. 
A set of integers R = {ri,...,1Tm} is called a complete set of residues 
modulo m if rj,...,7m are pairwise incongruent modulo m and every in- 


teger x is congruent modulo m to some integer r; € R. For example, the 
set {0,2,4,6,8,10,12} is a complete set of residues modulo 7. The set 
{0,3,6,9,12,15,18,21} is a complete set of residues modulo 8. The set 
{0,1,2,...,m— 1} is a complete set of residues modulo m for every posi- 
tive integer m. 

There is a natural way to define addition, subtraction, and multiplication 
of congruence classes. If 


a, =a, (mod m) 


and 
bi = bz (mod m), 
then 
a, +b) =ao4+ be (mod m), 
ay — by = ag— bo (mod m), 
and 


a,b, = Agb (mod m). 


These statements are consequences of the identities 


(ay + b;) aa (a2 + bz) = (ay a2) t (by bz) =0 (mod m), 


(ay A bi) al (ag rz bz) = (ay az) (b1 bz) =0 (mod m) 
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and 


a,b, — agb2 = ay (by be) t (ay az)be =0 (mod m). 


Addition, subtraction, and multiplication in Z/mZ are well-defined if we 
define the sum, difference, and product of congruence classes modulo m by 


(a+ mZ) + (b+ mZ) = (a+b) + mZ, 


(a+mZ) — (b+ mZ) = (a—b) + mZ, 


and 
(a+mZ) - (b+ mZ) = ab+ mZ. 


Addition of congruence classes is associative and commutative, since 


((a+mZ) + (b+ mZ)) + (c+ mZ) 
((a +b) + mZ) + (c+ mZ) 
((a+b)+c)+mZ 

= (a+(b+c))+mZ 
(a+ mZ) + ((b+ c) + mZ) 
( b4 


= (a+mZ) + ((b+mZ) + (c+ mZ)) 
and 
(a+mZ)+(b+mZ) = (a+b)4+mZ 
= (b+a)+mZ 


= (b+mZ)+(a+mZ). 


The congruence class mZ is a zero element for addition, since mZ + (a+ 
mZ) =a+mZ for alla+mZ € Z/mZ, and the additive inverse of the 
congruence class a+ mZ is —a+mZ, since 


(a+mZ) + (-a+mZ) =(a-—a)+mZ=m2Z. 


From these identities we see that the set of congruence classes modulo m 
is an abelian group under addition. 

We have also defined multiplication in Z/mZ. Multiplication is associa- 
tive and commutative, since 


((a+ mZ)(b+mZ))(c+mZ) = (ab)c+mZ 
= a(bc)+mZ 
= (a+mZ)((b+ mZ)(c+mZ)) 
and 


(a+ mZ)(b+ mZ) = ab+ mZ = ba+ mZ = (b+ mZ)(a+mZ). 
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The congruence class 1 + mZ is an identity for multiplication, since 
(1+ mZ)(a+mZ)=a+mZ 


for all a+ mZ € Z/mZ. Finally, multiplication of congruence classes is 
distributive with respect to addition in the sense that 


(a+mZ)((b+ mZ) + (c+mZ)) 
a(b +c) +mZ) 
= (ab+ mZ) + (ac+mZ) 
(a+mZ)(b+ mZ) + (a+ mZ)(e+ mZ) 


I 


for alla+mZ,b+mZ,c+mZ € Z/mZ. 

A ring is a set R with two binary operations, addition and multiplication, 
such that R is an abelian group under addition with additive identity 0, 
and multiplication satisfies the following axioms: 


(i) Associativity: For all x,y,z € R, 
(xy)z = x(yz). 


(ii) Identity element: There exists an element 1 € R such that for all 
re R, 


The element 1 is called the multiplicative identity of the ring. 
(iii) Distributivity: For all 2, y,z € R, 


x(yt+z)=xryt az. 


The ring R is commutative if multiplication also satisfies the axiom 


(iv) Commutativity: For all 7, y € R, 
ry = ye. 


The integers, rational numbers, real numbers, and complex numbers are 
examples of commutative rings. The set M2(C) of 2 x 2 matrices with 
complex coefficients and the usual matrix addition and multiplication is a 
noncommutative ring. 

Let R and S be rings with multiplicative identities 1p and 1s, respec- 
tively. A map f : R — S is called a ring homomorphism if f(a + y) = 
f(z) + f(y) and f(xy) = f(x) f(y) for all x,y € R, and f(1r) = 1s. 

An element a in the ring R is called a unit if there exists an element 
x € R such that az = 2a =1. Ifaisa unit in Rand « € Rand ye Rare 
both inverses of a, then x = x(ay) = (xa)y = y, and so the inverse of a is 
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unique. We denote the inverse of a by a~!. The set R* of all units in R is 
a multiplicative group, called the group of units in the ring R. A field is a 
commutative ring in which every nonzero element is a unit. For example, 
the rational, real, and complex numbers are fields. The integers form a ring 
but not a field, and the only units in the ring of integers are +1. 

The various properties of sums and products of congruence classes that 
we proved in this section are equivalent to the following statement. 


Theorem 2.1 For every integer m > 2, the set Z/mZ of congruence 
classes modulo m is a commutative ring. 


Exercises 
1. Compute the least nonnegative residue of 10* + 1 modulo 13 for k = 
1,2,3,4. 
2. Compute the least nonnegative residue of 57? modulo 23. 
3. Construct the multiplication table for the ring Z/5Z. 
4. Construct the multiplication table for the ring Z/6Z. 
5. Prove that every integer is congruent modulo 9 to one of the even 
integers 0,2,4,6,...,16. 
6. Let m be an odd positive integer. Prove that every integer is congru- 
ent modulo m to one of the even integers 0,2,4,6,...,2m — 2. 
7. Prove that every integer is congruent modulo 9 to a unique integer r 
such that —4<r< 4. 
8. Let m = 2q¢+ 1 be an odd positive integer. Prove that every integer 
is congruent modulo m to a unique integer r such that —q <r <q. 
9. Let m = 2q be an even positive integer. Prove that every integer is 
congruent modulo m to a unique integer r such that —(q—1) <r<q. 
10. Prove that a? =a (mod 6) for every integer a. 
11. Prove that at =1 (mod 5) for every integer a that is not divisible 
by 5. 
12. Prove that if a is an odd integer, then a2 =1 (mod 8). 
13. Let d be a positive integer that is a common divisor of a,b, and m. 


Prove that 
a=b (mod m) 


if and only if 
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14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Prove that if x,y, z are integers such that «2? + y? = z?, then ryz = 0 
(mod 60). 


Prove that a1 = a2 (mod m) implies af = ak (mod m) for all 


k > 1. Prove that if f(x) is a polynomial with integer coefficients and 
a1 =a2 (mod m), then f(ai) = f(a2) (mod m). 


(A criterion for divisibility by 9.) Prove that a positive integer n is 
divisible by 9 if and only if the sum of its decimal digits is divisible by 
9. (For example, the sum of the decimal digits of 567 is5+6+7 = 18.) 


Hint: Prove that 10" =1 (mod 9) for every nonnegative integer k. 


(A criterion for divisibility by 11.) Prove that a positive integer n is 
divisible by 11 if and only if the alternating sum of its decimal digits 
is divisible by 11. (For example, the alternating sum of the decimal 
digits of 80,729 is -9+ 2-7+0-—8=-—22.) 


Hint: Prove that 10 = (—1)* (mod 11) for every nonnegative inte- 
ger k. 


Prove that if 71,...,@m is a sequence of m not necessarily distinct 
integers, then there is a subsequence of consecutive terms whose sum 
is divisible by m, that is, there exist integers 1 < k < €< m such 


that 
£ 


PE =0 (mod m). 


i=k 


Hint: Consider the m+ 1 integers 0,271,471 +%2,%1 +%2+9%3,...,%1+ 
ty t+ +2m. 


Let m > 2 and let d be a positive divisor of m — 1. Let n = ap + 
aym+---+a,m* be the m-adic representation of n. Prove that n =0 
(mod d) if and only if a9 +a@1+---+a, =0 (mod d). 


Let n be a positive integer such that n = 3 (mod 4). Prove that n 
cannot be written as the sum of two squares. 


Prove that every integer belongs to at least one of the following 6 
congruence classes: 


WNwWworRono 
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22. Let p be prime, m > 1, and 0 <k < p—1. Prove that 
k 
N= ay ) =m _ (mod p). 
Pp 
Hint: Consider the integer (p — 1)!N modulo p. 


23. Let G be the subset of M2(C) consisting of the four matrices 


(ot )(2 o)-(% +)-C-4 9) 


Prove that G is a multiplicative group isomorphic to the additive 
group of congruence classes Z/4Z. 


2.2 Linear Congruences 
The following theorem is one of the most useful and important tools in 
elementary number theory. 


Theorem 2.2 Let m,a,b be integers with m > 1. Let d = (a,m) be the 
greatest common divisor of a and m. The congruence 


ax =b (mod m) (2.1) 
has a solution if and only if 
b=0 (mod d). 


Ifb=0 (mod d), then the congruence (2.1) has exactly d solutions in in- 
tegers that are pairwise incongruent modulo m. In particular, if (a,m) = 1, 
then for every integer b the congruence (2.1) has a unique solution modulo 
mM. 


Proof. Let d = (a,m). Congruence (2.1) has a solution if and only if 
there exist integers x and y such that 


ax —b=my, 


or, equivalently, 
b=ax— my. 
By Theorem 1.15, this is possible if and only ifb=0 (mod d). 
If « and x, are solutions of (2.1), then 


a(a1 — 2) =ax, —axv =b—b=0 (mod m), 


and so 
a(a, — x) = mz 
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for some integer z. If d is the greatest common divisor of a and m, then 


(a/d,m/d) = 1 and , ‘ 
(gq) @-#) = (q) # 


By Euclid’s lemma (Theorem 1.7), m/d divides x, — x, and so 


am 
ee ae 


for some integer 7, that is, 


m 

x,=2 (mod 7 ). 
Moreover, every integer x; of this form is a solution of (2.1). An integer x7; 
congruent to x modulo m/d is congruent to « + im/d modulo m for some 
integer i = 0,1,...d—1, and the d integers «+im/d with i = 0,1,...,d—1 
are pairwise incongruent modulo m. Thus, the congruence (2.1) has exactly 
d pairwise incongruent solutions. This completes the proof. 


Theorem 2.3 If p is a prime, then Z/pZ is a field. 


Proof. If a+ pZ € Z/pZ and a+ pZ # pZ, then a is an integer not 
divisible by p. By Theorem 2.2, there exists an integer x such that ax = 1 
(mod p). This implies that 


(a+ pZ)(a + pZ) =1+ pZ, 


and so a+ pZ is invertible. Thus, every nonzero congruence class in Z/pZ 
is a unit and Z/pZ is a field. 


Here are some examples of linear congruences. The congruence 
7x =3 (mod 5) 


has a unique solution modulo 5 since (7,5) = 1. The solution is 7 = 4 
(mod 5). The congruence 


35a =-14 (mod 91) (2.2) 
is solvable since (35,91) = 7 and 
—14=0 (mod 7). 
Congruence (2.2) is equivalent to the congruence 


5x =-2 (mod 13), (2.3) 
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which has the unique solution x = 10 (mod 13). Every solution of (2.2) 
satisfies 
x=10 (mod 13) 


and so a complete set of solutions that are pairwise incongruent modulo 91 
is {10, 23, 36, 49, 62, 75, 88}. 


Lemma 2.1 Let p be a prime number. Then x? = (mod p) if and only 
if2«=+1 (mod p). 


Proof. If =+1 (mod p), then x? =1 (mod p). Conversely, if x? = 1 
(mod p), then p divides x? — 1 = (a —1)(a+1), and so p must divide x —1 
orz+1. 


Theorem 2.4 (Wilson) [fp is prime, then 


(p —1)!=-1 (mod p). 


Proof. This is true for p = 2 and p = 3, since 1! = —1 (mod 2) 
and 2! = —-1 (mod 3). Let p > 5. By Theorem 2.2, to each integer a € 
{1,2,...,p—1} there is a unique integer a“! € {1,2,...,p —1} such that 
aa t= (mod p). By Lemma 2.1, a = a7! if and only ifa=1ora= 


p—1. Therefore, we can partition the p—3 numbers in the set {2,3,...,p—2} 
into (p — 3)/2 pairs of integers {a;,a;'} such that aja;'=1 (mod p) for 
i=1,...,(p—3)/2. Then 


(pat = Te 2s tp 2\(p 1) 
(p—3)/2 
= (p-1l) II ajas” 
= p-l 7 
= -1 (mod p). 


This completes the proof. 


For example, 
4!=24=-—1 (mod 5) 


and 
6! = 720 =—1 (mod 7). 
The converse of Wilson’s theorem is also true (Exercise 7). 


Theorem 2.5 Let m and d be positive integers such that d divides m. If a 
is an integer relatively prime to d, then there exists an integer a’ such that 
a’ =a _ (mod d) and a’ is relatively prime to m. 
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Proof. Let m = inven pi andd = Th p;', where r; > land0O < 5; <r; 


a a 


fori=1,...,k. Let m’ be the product of the prime powers that divide m 
but not d. Then 


k 
yan) 
and 
(a) = 1h 


By Theorem 2.2, there exists an integer x such that 


dx=1—a (mod m’). 


Then 
a’ =a+dz=1 (mod m’) 
and so 
(a',m') =1 
Also, 


a’ =a _ (mod d). 


If (a’,m) # 1, there exists a prime p that divides both a’ and m. However, 
p does not divide m’ since (a’,m’) = 1. It follows that p divides d, and 
so p divides a’ — dx = a, which is impossible since (a,d) = 1. Therefore, 
(a’,m) = 1. 


Ifa@=b (mod ™m), then a= b+ mz for some integer x. An integer d is 
a common divisor of a and m if and only if d is a common divisor of b and 
m, and so (a,m) = (b,m). In particular, if a is relatively prime to m, then 
every integer in the congruence class of a+ mZ is relatively prime to m. 
A congruence class modulo m is called relatively prime to m if some (and, 
consequently, every) integer in the class is relatively prime to m. 

We denote by y(m) the number of congruence classes in Z/mZ that are 
relatively prime to m. The function y(m) is called the Euler phi function. 
Equivalently, y(m) is the number of integers in the set 0,1,2,...,m—1 
that are relatively prime to m. The Euler phi function is also called the 
totient function. 

A set of integers {r1,...,7y(m)} is called a reduced set of residues modulo 
m if every integer x such that (2,m) = 1 is congruent modulo m to some 
integer r;. For example, the sets {1,2,3,4,5,6} and {2,4,6,8,10,12} are 
reduced sets of residues modulo 7. The sets {1,3,5, 7} and {3,9, 15,21} are 
reduced sets of residues modulo 8. 

An integer a is called invertible modulo m or a unit modulo m if there 
exists an integer x such that 


ax =1 (mod m). 
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By Theorem 2.2, a is invertible modulo m if and only if a is relatively prime 
to m. Moreover, if a is invertible and az =1 (mod m), then z is unique 
modulo m. The congruence class a+ mZ is called invertible if there exists 
a congruence class « + mZ such that 


(a+mZ)(a+mZ) =14+ mZ. 


We denote the inverse of the congruence class a+ mZ by (a+ mZ)~! = 
a~'+mZ. The invertible congruence classes are the units in the ring Z/mZ. 
We denote the group of units in Z/mZ by 


(Z/mZ)” . 
If R= {ri,..-,1%g(m)} is a reduced set of residues modulo m, then 
(Z/mZ)* ={r+mZ:r € R} 


and 


|(Z/mZ)*| = (mn). 


For example, 
(Z/6Z)* = {14+ 6Z,5+6Z} 


and 


(Z/7Z)* = {14+ 7Z,24+7Z,34+7Z,4+7Z,5+7Z,6+7Z}. 


If a+ mZ is a unit in Z/mZ, then (a,m) = 1 and we can apply the 
Euclidean algorithm to compute (a+ mZ)~!. If we can find integers x and 
y such that 

ax+my=1, 
then 
(a+mZ)(a+mZ) =14+ mZ, 


and x+mZ=(a+mZ)"!. 
For example, to find the inverse of 13+ 17Z, we use the Euclidean algo- 
rithm to obtain 


17 = 13-144, 
13 = 4-341, 
4 = 1-4. 
This gives 
1=13-—4-3=13-(17-13-1)3 =13-4-17-3, 
and so 
13-4=1 (mod 17). 
Therefore, 


(18 +17Z)"' =4+4 172. 
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Exercises 

1. Find all solutions of the congruence 44 =9 (mod 11). 

2. Find all solutions of the congruence 12% = 3 (mod 45). 

3. Find all solutions of the congruence 28% = 35 (mod 42). 

4. Find all solutions of the system of congruences 

5a+7y=3 (mod 17) 
2x +3y=-2 (mod 17). 
5. Find all solutions of the system of congruences 
8x+5y=1 (mod 13) 

4x + 3y=3 (mod 13). 

6. Find the inverse of each nonzero congruence class modulo 13. 

7. Prove that if m is composite and m 4 4, then (m—1)!=0 (mod m). 
This is the converse of Wilson’s theorem. 

8. Prove that if p > 5 is an odd prime, then 

6(p—4)!=1 (mod p). 

9. Let m and a be integers such that m > 1 and (a,m) = 1. Prove 
that if {r1,...,Ty(m)} is a reduced set of residues modulo m, then 
{@r1,..-,47y(m)} is also a reduced set of residues modulo m. 

10. We say that an integer a is nilpotent modulo m if there exists a pos- 
itive integer k such that a* =0 (mod m). Prove that a is nilpotent 
modulo m if and only if@=0 (mod rad(m)). 

11. For n > 1, consider the rational number 


where u,, and Uy are positive integers. Prove that if p is an odd prime, 
then the numerator up—1 of hp_1 is divisible by p. 


Hint: Write hp, as a fraction with denominator (p — 1)!, and apply 
Wilson’s theorem. 
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12. (A criterion for divisibility by 7.) Let n be a positive integer, and 
let dydy_1...d do be the usual 10-adic representation of n. Define 
f(n) = dgdp_1...d1 — 2do. (For example, if n = 203, then do = 3, 
d, =0, dy = 2, and f(203) = 20—6 = 14.) Prove that n is divisible by 
7 if and only if f(n) is divisible by 7. Use this criterion to determine 
if 7875 is divisible by 7. 

Hint: Prove that 10u + u=0 (mod 7) if and only if vu — 2u = 0 
(mod 7). 


13. Let k > 3. Find all solutions of the congruence 


a? =1 (mod 2*), 


2.3. The Euler Phi Function 


An arithmetic function is a function defined on the positive integers. The 
Euler phi function y(m) is the arithmetic function that counts the number 


of integers in the set 0,1,2,...,m-— 1 that are relatively prime to m. We 
have 

g(1) = 1, y(6) = 2, 

y(2) = 2, y(7) = 6, 

9(3) = 3, y(8) = 4, 

y(4) = 2, (9) = 6, 

9(5) = 4, P10) w= od 


If p is a prime number, then (a, p) = 1 fora =1,...,p—1, and y(p) = p—1. 
If p" is a prime power and 0 < a < p” —1, then (a,p”) > 1 if and only if a 
is a multiple of p. The integral multiples of p in the interval [0,p" — 1] are 
the p"~+ numbers 0, p, 2p, 3p, ...,(p"~' — 1)p, and so 

r Tr r—-l1 Tr 1 

p(p") =p’ —p™ * =p" (1——}. 
Pp 

In this section we shall obtain some important properties of the Euler phi 
function. 


Theorem 2.6 Letm and n be relatively prime positive integers. For every 
integer c there exist unique integers a and b such that 


O0O<a<n-l, 
0<b<m-l, 


and 
c=ma+nb (mod mn). (2.4) 


Moreover, (c,mn) = 1 if and only if (a,n) = (b,m) = 1 in the representa- 


tion (2.4). 
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Proof. If a1, a2, b1, b2 are integers such that 
ma; +nby = maz + nbz (mod mn), 
then 
ma, = ma, + nby = mag +nbg = maz (mod n). 


Since (m,n) = 1, it follows that 
a; =a (mod n), 


and so a, = ag. Similarly, b, = bg. It follows that the mn integers ma+ nb 
are pairwise incongruent modulo mn. Since there are exactly mn distinct 
congruence classes modulo mn, the congruence (2.4) has a unique solution 
for every integer c. 

Let c=ma+nb (mod mn). Since (m,n) = 1, we have 


(c,m) = (ma + nb,m) = (nb,m) = (b,m) 


and 
(c,n) = (mat nb, n) = (ma,n) = (a,n). 

It follows that (c,mn) = 1 if and only if (c,m) = (c,n) = 1 if and only if 

(b,m) = (a,n) = 1. This completes the proof. 0 


For example, we can represent the congruence classes modulo 6 as linear 
combinations of 2 and 3 as follows: 


0 = 0-24+0-3 (mod 6), 
1 = 2-241-3 (mod 6), 
2 = 1-2+0-3 (mod 6), 
3 = 0-2+1-3 (mod 6), 
4 = 2-24+0-3 (mod 6), 
5 = 1-2+1-3 (mod 6). 


A multiplicative function is an arithmetic function f(m) such that f(mn) = 
f(m)f(n) for all pairs of relatively prime positive integers m and n. If 
f(m) is multiplicative, then it is easy to prove by induction on & that if 
my,..., Mz are pairwise relatively prime positive integers, then f(m1---mz) = 


fm) +> fm). 


Theorem 2.7 The Euler phi function is multiplicative. Moreover, 


vf (-3) 
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Proof. Let (m,n) = 1. There are y(mn) congruence classes in the ring 
Z/mnZ that are relatively prime to mn. By Theorem 2.6, every congruence 
class modulo mn can be written uniquely in the form ma+ nb + mnZ, 
where a and 6 are integers such that 0 <a<n-—landO<b<m-l1. 
Moreover, the congruence class ma+nb+mnZ is prime to mn if and only 
if (b,m) = (a,n) = 1. Since there are y(n) integers a € [0,n — 1] that are 
relatively prime to n, and y(m) integers b € [0,m — 1] relatively prime 
to m, it follows that y(mn) = y(m)y(n), and so the Euler phi function is 


multiplicative. If m1,,...,m, are pairwise relatively prime positive integers, 
then y(m,---m%) = y(m1)---p(m,). In particular, if m = pj'---p,* is 
the standard factorization of m, where p,,...,px are distinct primes and 
T1,-.-,;Tk are positive integers, then 
k k 1 1 
e(m) = |] 9 (') = [pF (1 7 ) =m]]| (1 = ) 
i=l i=l Pi P 
= = p|m 


This completes the proof. 


For example, 7875 = 325°7 and 


(7875) = v(37)y(53)y(7) = (9 — 3)(125 — 25)(7 — 1) = 3600. 
Theorem 2.8 For every positive integer m, 
> (d) =m. 
d|m 


Proof. We first consider the case where m = p* is a power of a prime p. 
The divisors of p* are 1, p,p?,...,p", and 


t t 


S>9(@) =S5 op") =14+ 90 (ph -p") =. 


d|pt r=0 r=1 
Next we consider the general case where m has the standard factorization 
m = pip? + pie, 
where p1,..., px are distinct prime numbers and ¢,...,t, are positive in- 
tegers. Every divisor d of m is of the form 
d= py py + Dy 


where 0 <r; < t; fori =1,...,k. By Theorem 2.7, y(d) is multiplicative, 
and so 
9(d) = 9(p;")p(p2) ++ p(Pi)- 
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Therefore, 
ty tr 
Sood) = So-- Sd) opp ++ pe) 
d|m r14=0 rp=0 
ti k 
= So--- SS vt) e@?)--- o@%e) 
r1=0 Th=O0 
k ti 
= y (p;") 
i=17r;=0 


I 
ol 
a 


i=1 
= m. 
This completes the proof. 
For example, 
S>o(d) = 9(1) + (2) + (3) + 9(4) + ¥(6) + (12) 


dj12 


= 12 
and 
Sood) = v(1) + (3) + 95) + v9) + v(15) + (45) 
a = 142444648424 
= 45. 
Exercises 


1. Compute y(6993). 


2. Represent the congruence classes modulo 12 in the form 3a+ 4b with 
O0O<a<3and0<b<2. 


3. Let m = 15. Compute y(d) for every divisor d of m, and check that 
pedi y(d) = m. Repeat this exercise for m = 16,17, and 18. 


4. Prove that y(m) is even for all m > 3. 


5. Prove that y(m*) = m*~!y(m) for all positive integers m and k. 
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6. Prove that m is prime if and only if y(m) = m-—1. 

7. Prove that y(m) = y(2m) if and only if m is odd. 

8. Prove that if m divides n, then y(m) divides y(n). 

9. Find all positive integers n such that y(n) is not divisible by 4. 
10. Find all positive integers n such that y(5n) = 5y(n). 


11. Let f(n) = y(n)/n. Prove that y(p*) = y(p) for all primes p and all 
positive integers k. 


12. This problem gives an alternative proof of Theorem 2.8. Let m > 1, 
and let S be the set of fractions k/m with k = 0,1,...,m-—1. Write 
each fraction in lowest terms: k/m = a/d, where d is a divisor of m 
and (a, d) = 1. For example, 0/m = 0/1. Show that for each divisor d 
of m there are exactly y(d) fractions k/m € S' that have denominator 
d when reduced to lowest terms. Deduce that }7q),, 9(d) =m. 


13. Let N,,(a) denote the number of positive integers not exceeding « 
that are relatively prime to m. Prove that 


sam Nm(2) _ elm) 
LOCO x m 


This result can be expressed as follows: The probability that a random 
integer is prime to m is y(m)/m. 


2.4 Chinese Remainder Theorem 


Theorem 2.9 Let m and n be positive integers. For any integers a and b 
there exists an integer x such that 


x=a (mod m) (2.5) 


and 
x=b (mod n) (2.6) 


if and only if 
a=b (mod (m,n)). 


If x is a solution of congruences (2.5) and (2.6), then the integer y is also 
a solution if and only if 


x=y (mod [m,n)). 


62 2. Congruences 


Proof. If x is a solution of congruence (2.5), then = a+ mu for some 
integer u. If x is also a solution of congruence (2.6), then 


cr=atmu=b (mod n), 
that is, 
a+mu=b+nv 


for some integer v. It follows that 
a—b=nv—mu=0 (mod (m,n)). 


Conversely, ifa—b=0 (mod (m,n)), then by Theorem 1.15 there exist 
integers u and v such that 


a—b=nv-—mu. 
Then 
r=a+tmu=b+nvu 


is a solution of the two congruences. 
An integer y is another solution of the congruences if and only if 


y=a=x (mod m) 
and 
y=b=xz (mod n), 


that is, if and only if s—y is acommon multiple of m and n, or, equivalently, 
x — y is divisible by the least common multiple [m,n]. This completes the 
proof. 


For example, the system of congruences 
= 5 (mod 21), 
= 19 (mod 56), 


has a solution, since 
(56,21) = 7 


and 
19=5 (mod 7). 


The integer x is a solution if there exists an integer u such that 
x=5+2lu=19 (mod 56), 


that is, 
2iu=14 (mod 56), 
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3u=2 (mod 8), 
or 
u=6 (mod 8). 
Then 
x =5+2lu=5+21(6 + 8v) = 131 + 168u 


is a solution of the system of congruences for any integer v, and so the set 
of all solutions is the congruence class 131 + 168Z. 


Theorem 2.10 (Chinese remainder theorem) Let k > 2. Ifay,...,az 
are integers and m,,..., Mx are pairwise relatively prime positive integers, 
then there exists an integer x such that 


x=a; (modm,) for alli=1,...,k. 


If x is any solution of this set of congruences, then the integer y is also a 
solution if and only if 


x=y (mod my,---m«x). 


Proof. We prove the theorem by induction on k. If k = 2, then [m,, m2] = 
mM, and this is a special case of Theorem 2.9. 

Let k > 3, and assume that the statement is true for & — 1 congruences. 
Then there exists an integer z such that z =a; (mod m,) fori =1,...,k- 
1. Since m1,..., mx are pairwise relatively prime integers, we have 


(m1 -++Mg-1, Me) = 1, 
and so, by the case k = 2, there exists an integer x such that 


= z (mod m,---mg-1), 
= az (mod mz). 
Then 
L=z=a; (mod m) 
fori=1,...,k-—1. 

If y is another solution of the system of & congruences, then x — y is 
divisible by m, for alli =1,...,k. Since mj1,..., mx are pairwise relatively 
prime, it follows that x — y is divisible by m,---m,. This completes the 
proof. 


For example, the system of congruences 


x = 2 (mod 83), 
x = 3 (mod5), 
x = 5 (mod 7), 
x = 7 (mod 11) 
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has a solution, since the moduli are pairwise relatively prime. The solution 
to the first two congruences is the congruence class 


x=8 (mod 15). 
The solution to the first three congruences is the congruence class 
x =68 (mod 105). 
The solution to the four congruences is the congruence class 
x =1118 (mod 1155). 


There is an important application of the Chinese remainder theorem to 
the problem of solving diophantine equations of the form 


f(x1,..-,%%) =0 (mod m), 


where f(#1,...,2,) is a polynomial with integer coefficients in one or sev- 
eral variables. This equation is solvable modulo m if there exist integers 
Q1,...,@% Such that 


f(ai1,...,@%) =O (mod m). 


The Chinese remainder theorem allows us to reduce the question of the 
solvability of this congruence modulo m to the special case of prime power 
moduli p”. For simplicity, we consider polynomials in only one variable. 
Theorem 2.11 Let 
Ty Tk 

m — Py eee Pr 
be the standard factorization of the positive integer m. Let f(a) be a poly- 
nomial with integral coefficients. The congruence 


f(x) =0 (mod m) 
is solvable if and only if the congruences 

f(x) =0 (mod ph) 
are solvable for alli =1,...,k. 


Proof. If f(z) =0 (mod m) has a solution in integers, then there exists 
an integer a such that m divides f(a). Since p;* divides m, it follows that 
p;' divides f(a), and so the congruences f(z) =0 (mod pj‘) are solvable 
fori =1,...,k. 

Conversely, suppose that the congruences f(z) =0 (mod p;‘) are solv- 
able for i= 1,...,k. Then for each 7 there exists an integer a; such that 


f(ai)=0 (mod p;*). 
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Since the prime powers p{’,...,p;" are pairwise relatively prime, the Chi- 
nese remainder theorem tells us that there exists an integer a such that 


a=a; (mod p;*) 
for all 7. Then 
f(a) = f(a) =0 (mod p;') 
for all i. Since f(a) is divisible by each of the prime powers p;", it is also 


divisible by their product m, and so f(a) =0 (mod m). This completes 
the proof. 


For example, consider the congruence 
f(x) =2?—34=0 (mod 495). 
Since 495 = 37-5- 11, it suffices to solve the congruences 


f(a) =2? —34=27+2=0 (mod 9), 


f(z) =" —34=27+1=0 (mod 5), 


and 
f(t) =2? -34=27-1=0 (mod 11). 


These congruences have solutions 


and 
f(1)=0 (mod 11). 


By the Chinese remainder theorem, there exists an integer a such that 


a = 5 (mod 9), 
= 2 (mod 5), 
a = 1 (mod 11) 


Solving these congruences, we obtain 
a=122 (mod 495). 
We can check that 
f(122) = 1227 — 34 = 14,850 = 30- 495, 


and so 
f(122) =0 (mod 495). 
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Exercises 


1. Find all solutions of the system of congruences 


= 4 (mod 5), 
5 (mod 6). 


. Find all solutions of the system of congruences 


(mod 12), 


5 
8 (mod 9). 


. Find all solutions of the system of congruences 


(mod 12), 
(mod 10). 


. Find all solutions of the system of congruences 


(mod 5), 


20. S11 
3x =4 (mod 7). 


. Find all integers that have a remainder of 1 when divided by 3, 5, 


and 7. 


. Find all integers that have a remainder of 2 when divided by 4 and 


that have a remainder of 3 when divided by 5. 


. Find all solutions of the congruence 


f(z) =5z? —93=0 (mod 231). 


. (Bhaskara, sixth century) A basket contains n eggs. If the eggs are 


removed 2,3, 4,5, or 6 at a time, then the number of eggs that remain 
in the basket is 1,2,3,4, or 5, respectively. If the eggs are removed 
7 at a time, then no eggs remain. What is the smallest number n of 
eggs that could have been in the basket at the start of this procedure? 


Hint: The first condition implies that n =1 (mod 2). 


. Let f be a polynomial with integer coefficients. For m > 1, let N¢(m) 


denote the number of pairwise incongruent solutions of f(a) = 0 
(mod m). Prove that the function Ny(m) is multiplicative, that is, 
N¢(mimza) = N;(m1).N¢ (ma) if (m1,m2) = 1, 
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10. Let m,..., mx be pairwise relatively prime positive integers and m = 
my1--++mMmp. Define the map 


f :(Z/mZ)* — (Z/m,Z)* x --- x (Z/m_Z)* 
by 
f(a+mZ) = (a+ mZ,...,a+m,Z). 


Use the Chinese remainder theorem to show directly that this map 
is one-to-one and onto. 


2.5  Euler’s Theorem and Fermat’s Theorem 


Euler’s theorem and its corollary, Fermat’s theorem, are fundamental re- 
sults in number theory, with many applications in mathematics and com- 
puter science. In the following sections we shall see how the Euler and 
Fermat theorems can be used to determine whether an integer is prime or 
composite, and how they are applied in cryptography. 


Theorem 2.12 (Euler) Let m be a positive integer, and let a be an inte- 
ger relatively prime to m. Then 


a?(™ = 1 (mod m). 


Proof. Let {r1,...,1(m)} be a reduced set of residues modulo m. Since 
(a,m) = 1, we have (ar;,m) = 1 for i = 1,...,y(m). Consequently, for 
every 7 € {1,...,y¢(m)} there exists o(7) € {1,...,~(m)} such that 


ar; =o) (mod m). 


Moreover, ar; = ar; (mod m) if and only if ¢ = j, and so a is a permuta- 
tion of the set {1,...,y(m)} and {arj,...,arg(m)} is also a reduced set of 
residues modulo m. It follows that 


al ripe + ‘Te(m) = (ar1)(ar2)---(argim)) (mod m) 
To(1)To(2)°**Ta(g(m)) (mod m) 


= 11T2°''Ty(m) (mod m). 


Dividing by rire---rg¢m), we obtain 


a? = 1 (mod m). 


This completes the proof. 


The following corollary is sometimes called Fermat’s little theorem. 
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Theorem 2.13 (Fermat) Let p be a prime number. If the integer a is not 
divisible by p, then 
a?-'=1 (mod p). 
Moreover, 
a? =a _ (mod p) 

for every integer a. 

Proof. If p is prime and does not divide a, then (a, p) = 1, y(p) = p—-1, 
and 

a?-+ — q?) =1 (mod p) 


by Euler’s theorem. Multiplying this congruence by a, we obtain 


Pp 


a? =a _ (mod p). 


If p divides a, then this congruence also holds for a. 


Let m be a positive integer and let a be an integer that is relatively 
prime to m. By Euler’s theorem, a?) = 1 (mod m). The order of a 
with respect to the modulus m is the smallest positive integer d such that 
at?=1 (mod m). Then 1 < d< y(m). We denote the order of a modulo 
m by ord,(a). We shall prove that ord,,(a) divides y(m) for every integer 
a relatively prime to p. 


Theorem 2.14 Let m be a positive integer and a an integer relatively 
prime to m. If d is the order of a modulo m, then a® = a’ (mod m) 
if and only ifk = (mod d). In particular, a” =1 (mod m) if and only 


if d divides n, and so d divides ~(m). 


Proof. Since a has order d modulo m, we have a4 = 1 (mod m). If 
k= (mod d), then k = + dg, and so 


a Sal = a (a?)? =a’ (mod m). 


Conversely, suppose that a* = a’ (mod m). By the division algorithm, 


there exist integers g and r such that 
k—-lL=dq+r and O0<r<d-1l. 


Then 


a So ee Sa (a?)? a” =a*a" (mod m). 


k 


Since (a*,m) = 1, we can divide this congruence by a* and obtain 


a’ =1 (mod m). 
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Since 0 < r < d—1, and d is the order of a modulo m, it follows that r = 0, 
and sok = (mod d). 
If a” = 1 =a (mod m), then d divides n. In particular, d divides 


y(m), since a®°™ =1 (mod m) by Euler’s theorem. 


For example, let m = 15 and a = 7. Since y(15) = 8, Euler’s theorem 
tells us that 
7 =1 (mod 15). 
Moreover, the order of 7 with respect to 15 is a divisor of 8. We can compute 
the order as follows: 


7’ = 7 (mod 15), 

7 = 49=4 (mod 15), 
7 = 28=13 (mod 15), 
7 = 91=1 (mod 15), 


and so the order of 7 is 4. 

We shall give a second proof of Euler’s theorem and its corollaries. We 
begin with some simple observations about groups. We define the order of 
a group as the cardinality of the group. 


Theorem 2.15 (Lagrange’s theorem) If G is a finite group and H is 
a subgroup of G, then the order of H divides the order of G. 


Proof. Let G be a group, written multiplicatively, and let X be a 
nonempty subset of G. For every a € G we define the set 


aX = {ar: x € X}. 


The map f : X — aX defined by f(x) = ax is a bijection, and so |X| = 
|aX| for alla € G. If H is a subgroup of G, then aH is called a coset 
of H. Let aH and bH be cosets of the subgroup H. If aH nN bH F J, 
then there exist x,y € H such that ax = by, or, since H is a subgroup, 
b= ary! = az, where z = xy~! € H. Then bh = azh € aH for allh € H, 
and so bH C aH. By symmetry, aH C bH, and so aH = bH. Therefore, 
cosets of a subgroup 4H are either disjoint or equal. Since every element 
of G belongs to some coset of H (for example, a € aH for all a € G), it 
follows that the cosets of H partition G. We denote the set of cosets by 
G/H. If G is a finite group, then H and G/H are finite, and 


|G| = |H||G/H|. 


In particular, we see that |H| divides |G]. 


Let G be a group, written multiplicatively, and let a € G. Let H = {a" : 
k € Z}. Then 1 = a® € H CG. Since a*a’ = a*** for all k, ¢ € Z, it follows 
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that H is a subgroup of G. This subgroup is called the cyclic subgroup 
generated by a, and written (a). Cyclic subgroups are abelian. 

The group G is cyclic if there exists an element a € G such that G = (a). 
In this case, the element a is called a generator of G. For example, the group 
(Z/7Z)* is a cyclic group of order 6 generated by 3+ 7Z. The congruence 
class 5+ 7Z is another generator of this group. 

If a* £ a for all integers k 4 , then the cyclic subgroup generated by 
a is infinite. If there exist integers k and @ such that k < @ and a® = a’, 
then a‘—* = 1. Let d be the smallest positive integer such that a? = 1. 
Then the group elements 1,a,a?,...,a7~! are distinct. Let n € Z. By the 
division algorithm, there exist integers gq and r such that n = dq+r and 
0<r<d-—1. Since 
n q@atr = (a®)* a" r 


a =a’, 


it follows that 
(a) = {a":n€ Z} = {a":0<r<d-I}, 


and the cyclic subgroup generated by a has order d. Moreover, a* = a® if 
and only ifk = (mod d). 

Let G be a group, and let a € G. We define the order of a as the cardi- 
nality of the cyclic subgroup generated by a. 


Theorem 2.16 Let G be a finite group, anda € G. Then the order of the 
element a divides the order of the group G. 


Proof. This follows immediately from Theorem 2.15, since the order of 
a is the order of the cyclic subgroup that a generates. 


Let us apply these remarks to the special case when G = (Z/mZ)* is 
the group of units in the ring of congruence classes modulo m. Then G is a 
finite group of order v(m). Let (a,m) = 1 and let d be the order of a+mZ 
in G, that is, the order of the cyclic subgroup generated by a + mZ. By 
Theorem 2.16, d divides y(m), and so 


(m)/ 


ak™ 4 m= (a+mZ)?™ = ((a+mZ)*)” Ws 1+mZ. 


Equivalently, 
a? =1 (mod m). 


This is Euler’s theorem. 


Theorem 2.17 Let G be a cyclic group of order m, and let H be a subgroup 
of G. If a is a generator of G, then there exists a unique divisor d of m 
such that H is the cyclic subgroup generated by a*, and H has order m/d. 
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Proof. Let S be the set of all integers u such that a” € H. If u,v € S, 
then a“,a” € H. Since H is a subgroup, it follows that a“a” = a“t’ € H 
and a“(a”)~1 = a“~” € H. Therefore, utv € S, and S is a subgroup of Z. 
By Theorem 1.3, there is a unique nonnegative integer d such that S = dZ, 
and so H is the cyclic subgroup generated by a%. Since a™ = 1 € H, we 
have m € S, and so d is a positive divisor of m. It follows that H has order 
m/d. 


Theorem 2.18 Let G be a cyclic group of order m, and let a be a generator 
of G. For every integer k, the cyclic subgroup generated by a® has order 
m/d, where d = (m,k), and (a*) = (a*). In particular, G has exactly y(m) 
generators. 


Proof. Since d = (k,m), there exist integers x and y such that d = 
ka + my. Then 


at = qhetmy = (a*)* (a™)¥ = (a*)* 


and so a? € (a*) and (a%) C (a*). Since d divides k, there exists an integer 
z such that k = dz. Then 

at = (a’)*, 
and so a® € (a®) and (a*) C (a“). Therefore, (a*) = (a“) and a* has 
order m/d. In particular, a* generates G if and only if d = 1 if and only 
if (m,k) = 1, and so G has exactly y(m) generators. This completes the 
proof. 


We can now give a group theoretic proof of Theorem 2.8. Let G be a 
cyclic group of order m. For every divisor d of m, the group G has a unique 
cyclic subgroup of order d, and this subgroup has exactly y(d) generators. 
Since every element of G generates a cyclic subgroup, it follows that 


m= S" ¢(d). 


d|m 


Voila! 


Exercises 


1. Prove that 
3°'2=1 (mod 1024). 


2. Find the remainder when 7°! is divided by 144. 
3. Find the remainder when 21°” is divided by 31. 
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10. 


11. 


12. 


13. 


14. 
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. Compute the order of 2 with respect to the prime moduli 3, 5, 7, 11, 


13, 17, and 19. 


. Compute the order of 10 with respect to the modulus 7. 


. Let r; denote the least nonnegative residue of 10‘ (mod 7). Compute 


r;, for i = 1,...,6. Compute the decimal expansion of the fraction 1/7 
without using a calculator. Can you find where the numbers r1,..., 16 
appear in the process of dividing 7 into 1? 


. Compute the order of 10 modulo 13. Compute the period of the frac- 


tion 1/13. 


. Let p be prime and a an integer not divisible by p. Prove that if 


a?" =—1 (mod p), then a has order 2”*! modulo p. 


. Let m be a positive integer not divisible by 2 or 5. Prove that the 


decimal expansion of the fraction 1/m is periodic with period equal 
to the order of 10 modulo m. 


Prove that the decimal expansion of 1/m is finite if and only if the 
prime divisors of m are 2 and 5. 


Prove that 10 has order 22 modulo 23. Deduce that the decimal ex- 
pansion of 1/23 has period 22. 


Prove that if p is a prime number congruent to 1 modulo 4, then there 
exists an integer x such that 2? =—1 (mod p). 


Hint: Observe that 


(p—1)/2 (p—1)/2 
(P-1)! = |] sm~-y= J] Cy) 
j=l j=l 
(p-1)/2 \? 
= (-1)@-vP II f (mod p), 
j=l 


and apply Theorem 2.4. 


Prove that if n > 2, then 2” — 1 is not divisible by n. 


Hint: Let p be the smallest prime that divides n. Consider the con- 
gruence 2" =1 (mod p). 


Prove that if p and q are distinct primes, then 


pr tig tet (mod pq). 


15. 


16. 


17. 


18. 


19. 
20. 


21s 


22. 
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Prove that if m and n are relatively prime positive integers, then 


ml 4 7? =1 (mod mn). 


Let p be an odd prime. By Euler’s theorem, if (a,p) = 1, then 


Prove that if (ab, p) = 1, then 
fp(ab) = f(a) + fp(b) (mod p). 


Let f(a) and g(x) be polynomials with integer coefficients. We say 
that f(x) is equivalent to g(a) modulo p if 


f(a) = g(a) (mod p) for all integers a. 


Prove that the polynomials x°+5x7 +3 and «?—22+ 24 are equivalent 
modulo 7. Prove that every polynomial is equivalent modulo p to a 
polynomial of degree at most p— 1. 


Hint: Use Fermat’s theorem. 


Let G be the group (Z/7Z)*. Determine all the cyclic subgroups of 
G. 


Prove that the group (Z/11Z)* is cyclic, and find a generator. 


Let G be a group with subgroup H. Define a relation ~ on G as 
follows: a ~ b if b-1a € H. Prove that this is an equivalence relation 
(that is, reflexive, symmetric, and transitive). Prove that a ~ b if and 
only if aH = bH, and so the equivalence classes of this relation are 
the cosets in G/H. 


Let G be an abelian group with subgroup H. Let G/H be the set of 
cosets of H in G. Define multiplication of congruence classes by 


aH -bH =abH. 


Prove that if aH = a'H and bH = 0'H, then abH = a’'b’H, and so 
multiplication of cosets is well-defined. Prove that G/H is an abelian 
group with this multiplication. This is called the quotient group of G 
by H. 


Let G be a group and let H and K be subgroups of G. For a € G, 
we define the double coset Hak = {hak:h € H,k € K}. Prove that 
ifa,b€ Gand Hak N HbK £9, then Hak = HbK. 
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Suppose we are given an odd integer n > 3, and we want to determine 
whether n is prime or composite. If n is “small,” we can simply divide n 
by all odd integers d such that 3 < d < \/n. If some d divides n, then n 
is composite; otherwise, n is prime. If n is “big,” however, this method is 
time-consuming and impractical. We need to find other primality tests. 

Fermat’s theorem can be applied to this problem. By Fermat’s theorem, 
if n is an odd prime, then 2"~'= 1 (mod n). Therefore, if n is odd and 
2"-'41 (mod n), then n must be composite. In general, we can choose 
any integer b that is relatively prime to n. By Fermat’s theorem, if n is 
prime, then b”~' = 1 (mod n). It follows that if b&°-' #1 (mod n), 
then n must be composite. Thus, for every base b, Fermat’s theorem gives 
a primality test, that is, a necessary condition for an integer n to be prime. 

Suppose we want to know whether n = 851 is prime or composite. We 
shall compute 2°°° (mod 851). An efficient method is to use the 2-adic 
representation of 850: 


850 = 24-27 4:98 498 4.99: 


482 
Since 2?” = (2? ‘) , we have 


2*=4 (mod 851), 

22 
2° =16 (mod 851), 
22° =256 (mod 851), 
2?°=9 (mod 851), 
2?” =81 (mod 851), 
22° = 604 (mod 851), 
22’ =588 (mod 851), 
22° = 238 (mod 851), 


2° = 478 (mod 851). 
Then 
2850 = 9292"92°92"92" (mod 851) 
= 4.9-604-238-478 (mod 851) 
= 16941 (mod 581), 


and so 851 is composite. To factor 851, we observe that 851 + 49 = 900, 
and so 


851 = 900 — 49 = 30? — 7? = (30 — 7)(30 + 7) = 23 - 37. 
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(To understand this factoring method, see Exercise 2.) 

This test can prove that an integer is composite, but it cannot prove 
that an integer is prime. For example, consider the composite number n = 
341 = 11-31, Choosing base b = 2, we have 


2*°=1 (mod 11), 


and so a 
= (2) Sa anods i), 
Similarly, 
2°=1 (mod 31), 
and so 


pet (9) Sis “(nod Bi) 


Since 2°4° — 1 is divisible by both 11 and 31, it is divisible by their product, 
that is, 
2°49 =1 (mod 341). 


A composite number n is called a pseudoprime to the base b if (b,n) = 1 
and b&"-'=1 (mod n). Thus, 341 is a pseudoprime to base 2. 
We can show that 341 is composite by choosing the base b = 7. Since 


7° = 343 =2 (mod 341) 


and 
2'° — 1024=1 (mod 341), 
it follows that 


7340 Sea (7°) 113 


7-2''8 (mod 341) 
7-23 (2!0)"' (mod 341) 
= 56 (mod 341) 

# 1 (mod 341). 


Can every composite number be proved composite by some primality 
test based on Fermat’s theorem? It is a surprising fact that the answer is 
“no.” There exist composite numbers n that cannot be proved composite 
by any congruence of the form b"~! (mod _n) with (b,n) = 1. For example, 
561 = 3-11-17 is composite. Let b be an integer relatively prime to 561. 
Then 

b?=1 (mod 3), 


and so ae 
p69 — (b7) =1 (mod 3). 
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Similarly, 
b'°=1 (mod 11), 
and so Zs 
pP°? = (b'°)" =1 (mod 11). 
Finally, 
b'©=1 (mod 17), 
and so 


p60 — (516)? = 1 (mod 17). 


Since b°°° — 1 is divisible by 3, 11, and 17, it is also divisible by their 
product, hence 
b°°° =1 (mod 561). 


This proves that 561 is a pseudoprime to base b for every 6 such that 


(b,n) = 1. 
A Carmichael number is a positive integer n such that n is composite 
but 6°"! =1 (mod n) for every integer b relatively prime to n. Thus, 561 


is a Carmichael number. 


Exercises 


1. Prove that 589 is composite by computing the least nonnegative 
residue of 2°88 (mod 589). 


2. Let n be an odd integer, n > 3. Prove that there exists a nonnegative 
integer u such that n+u? = (u+1)?. Prove that n is composite if and 
only if there exist nonnegative integers u and v such that v > u+1 
and n+ u? = v?. Use this method to factor 589. 


. Prove that 645 is a pseudoprime to base 2. 
. Prove that 1729 is a pseudoprime to bases 2, 3, and 5. 


. Prove that 1105 is a Carmichael number. 


D oh FF WwW 


. Let n be a product of distinct primes. Prove that if p—1 divides n—1 
for every prime p that divides n, then n is a Carmichael number. 


7. Prove that 6601 is a Carmichael number. 


2.7 Public Key Cryptography 


Cryptography is the art and science of sending secret messages. The message 
that we want to send is called the plaintext. The sender uses a key to 
encipher, or encrypt, it into ciphertezt, and the ciphertext is transmitted 
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to the receiver, who uses another key to decipher, or decrypt, it back into 
plaintext. By writing letters and punctuation marks as numbers, we can 
assume that the plaintext is a positive integer P, and that it is encrypted 
as a different positive integer C. The problem is to invent keys that make 
it impossible or computationally infeasible for an enemy to decipher an 
intercepted message. Cryptanalysis is the art and science of deciphering an 
intercepted message without knowledge of the decrypting key. 

Classically, cryptography uses secret keys that are known only to sender 
and receiver. If the enemy discovers the encrypting key and intercepts the 
ciphertext, then he might be able to compute the decrypting key and re- 
cover the plaintext. 

Here is an example of a secret key cryptosystem. Let p be an odd prime, 
and let e be an integer such that (e,p —1) = 1. Suppose that the plaintext 
P is an integer such that 0 < P < p. Let the ciphertext C' be the least 
nonnegative residue of P*® modulo p, that is, we construct C' by the rule 


C=P*° (mod p) 


and 
0<C<p. 


The encrypting key for this cipher consists of the prime number p and the 
integer e. To decrypt this cipher, we use elementary number theory. Since 
(e,p — 1) = 1, there exists an integer d such that ed =1 (mod p-—1). It 
is easy to compute d. We can use the Euclidean algorithm, for example. 
The decrypting key consists of the prime p and the integer d. Since ed = 
1+(p—1)k for some integer k, and since PP~'=1 (mod p) by Fermat’s 
theorem, it follows that 


Cts pts plre-Dk = pipe)" =P (mod p). 


Thus, we can decrypt the ciphertext C' by computing the least nonnegative 
residue of C? modulo p. An enemy who learns the encrypting key will break 
the cipher. 
For example, if p = 17 and e = 8, then the plaintext P = 10 is encrypted 
as 
P? = 10°? =14 (mod 17), 


and so the ciphertext is C = 14. Since 3-11 =1 (mod 16), it follows that 
d= 11 is a decrypting key. We observe that 


Cc’ =14"=10=P (mod 17). 


There is a more sophisticated idea in cryptography that produces secure 
ciphers even if the encrypting key is known. Indeed, the encrypting key can 
be made public, so that anyone can encrypt and send a message, but the 
decrypting key cannot be computed from knowledge of the encrypting key. 
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This is called a public key cryptosystem. Here is an example. We choose 
two different large primes p and q, and let 


Since we know p and q, it is easy to calculate y(m) = (p—1)(q—1). Pick an 
integer e that is relatively prime to y(m). We publish the numbers m and e. 
The plaintext must be a positive integer P that is less than m and relatively 
prime to m If m is a large number, then almost all positive integers less 
than m are relatively prime to m (Exercise 4), so we can assume that 
(P,m) = 1. The ciphertext will be the unique integer C such that 


C=P®* (mod m) 


and 
0<C<m. 


It is important to note that we disclose neither y(m) nor the prime factors 
p and q of m. These are kept secret. However, since we know y(m), it is 
easy, by using the Euclidean algorithm, for example, to compute an integer 
d such that 

ed=1 (mod ¢(m)), 


that is, 
ed=1+ y(m)k 


for some integer k. To decrypt the ciphertext C, we simply compute the 
least nonnegative residue of 


C* (mod m). 
Since (P,m) = 1, Euler’s theorem tells us that 
Ct pet = plenk — P (modim), 


The decryption key requires the integers d and m. It is not enough to 
know e and m. To compute d, one must know both e and y(m). Since 
y(m) = (p— 1)(q — 1), this requires a knowledge of the primes p and q 
such that m = pq, that is, we must be able to factor m. If the primes p 
and gq are large (such as several thousand digits each), then it is impossible 
with state-of-the-art computer hardware and our current knowledge about 
factoring large numbers to find the prime factors of m in a reasonable time, 
for example, a million years. We know the prime factors p and qg, and so we 
can compute y(m), but an opponent who wants to intercept and decrypt 
the message will fail, since he does not know the primes and cannot factor 
m. Indeed, the following result shows that knowing y(m) is equivalent to 
knowing the prime factors of m. 
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Theorem 2.19 Let m be an integer that is the product of two prime num- 
bers. The prime divisors of m are the roots of the quadratic equation 

x? —(m+1-—9(m))z+m=0, 
and so y(m) determines the prime factors of m. 


Proof. If m = pq, then 
m 
A a a) i a 2 a a a a ae ear 


and so m 
p—(m+1—y(m)) + i =0. 
Equivalently, p and q are the solutions of the quadratic equation 


z? —(m+1—9(m))z+m=0. 


This completes the proof. 


For example, if m = 221 and y(m) = 192, then the quadratic equation 
x” — 30x + 221 =0 


has solutions x = 13 and x = 17, and 221 = 13-17. 

This method, known as the RSA cryptosystem, is called a public key cryp- 
tosystem, since the encryption key is made available to everyone, and the 
encrypted message can be transmitted through public channels. Only the 
possessor of the prime factors of m can decrypt the message. RSA is simple, 
but useful, and is the basis of many commercially valuable cryptosystems. 


Exercises 


1. Consider the secret key cryptosystem constructed from the prime 
p = 947 and the encoding key e = 167. Encipher the plaintext P = 2. 
Find a decrypting key and decipher the ciphertext C = 3. 


2. Consider the primes p = 53 and q = 61. Let m = pg. Prove that 
e = 7 is relatively prime to y(m). Find a positive integer d such that 
ed=1 (mod y(m)). 


3. The integer 6059 is the product of two distinct primes, and y(6059) = 
5904. Use Theorem 2.19 to compute the prime divisors of 6059. 


4. The probability that an integer chosen at random between 1 and n is 
relatively prime to n is y(n)/n. Let n = pq, where p and q are distinct 
primes greater than x. Prove that the probability that a randomly 
chosen positive integer up to z is relatively prime to n is greater than 
(1 —1/a)?. If x = 200, this probability is greater than 0.99. 
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Si numerus a numerorum 0, c differentiam metitur, b et c secun- 
dum a congrui dicuntur, sin minus, incongrut: ipsum a modu- 
lum appellamus. Uterque numerorum 80, c priori in casu alterius 
residuum, in posteriori vero nonresiduum vocatur. 


C. F. Gauss [37] 


This is the first paragraph in the first section of Gauss’s Disquisitiones 
Arithmeticae, a seminal book on number theory that was published in 1801. 
The translation, with slight changes in notation, is the first paragraph of 
this chapter. Gauss introduced the idea of congruence, and proved many 
of the results on congruences that we obtain in this book. This is classical 
mathematics that every student of mathematics should learn. 

Carmichael conjectured in 1912 that the number of Carmichael numbers 
is infinite. Alford, Granville, and Pomerance [1] confirmed this in 1994. 
They proved that if C(x) is the number of Carmichael numbers less than z, 
then C(x) > x?/7 for all sufficiently large a. Erdés has made the stronger 
conjecture that for every ¢ > 0 there exists a number 2o(¢) such that 
C(x) > x!~* for all 2 > xo(e). For an expository article on primality 
testing and Carmichael numbers, see Granville [40]. 

There is a vast literature on applications of number theory to cryptogra- 
phy, but it is hard to assign credit for discoveries in this field, because much 
of the research is carried out in secret at government agencies responsible 
for communications security, and not published in unclassified scientific 
journals. For example, the idea of public key cryptography first appeared 
in the public domain in work of Diffie, Hellman, and Merkle [26, 65] in 1976. 
The RSA cryptosystem was invented and published by Rivest, Shamir, and 
Adleman[123] in 1978. Singh [135] has reported, however, that both the 
concept of public key cryptography and the RSA cryptosystem were dis- 
covered earlier by three British government cryptographers, James Ellis, 
Clifford Cocks, and Malcolm Williamson, working at Government Com- 
munications Headquarters (GCHQ) in Cheltenham, England. It is possible 
that government cryptographers in other countries also independently dis- 
covered these methods. 

Boneh [12] is a recent survey of the status of the RSA cryptosystem. 
In 1997, Shor [133] described an algorithm based on ideas from quantum 
mechanics that would factor large integers in “polynomial time,” that is, 
much faster than is now possible with classical algorithms and comput- 
ers. If it becomes possible to build quantum computers, then cryptography 
based on the difficulty of factoring large integers would become insecure 
and unreliable. For a review of classical computing, quantum computing, 
and Shor’s factoring algorithm, see Manin [95]. Information on quantum 
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computing is available on the internet from the University of Oxford’s Cen- 
ter for Quantum Computing (www.qubit.org). 

A good text on number theoretic cryptography is Koblitz, A Course in 
Number Theory and Cryptography [83]. 


3 


Primitive Roots and Quadratic 
Reciprocity 


3.1 Polynomials and Primitive Roots 


Let m be a positive integer greater than 1, and a an integer relatively 
prime to m. The order of a modulo m, denoted by ord,,,(a), is the smallest 
positive integer d such that a’ =1 (mod m). By Theorem 2.14, ord;,(a) 
is a divisor of the Euler phi function y(m). The order of a modulo m is 
also called the exponent of a modulo m. 

We investigate the least nonnegative residues of the powers of a modulo 
m. For example, ifm = 7 and a = 2, then 


2° = 1 (mod 7), 
2+ = 2 (mod 7), 
2? = 4 ‘(mod 7), 
oe & FT Gned 2): 
and 2 has order 3 modulo 7. If m = 7 and a = 3, then 
3° = 1 (mod 7), 
31 = 8 (mod 7), 
3? = 2 (mod 7), 
33 = 6 (mod 7), 
34 = 4 (mod 7), 
3° = 5 (mod 7), 
3° = 1 (mod 7), 
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and 3 has order 6 modulo 7. The powers of 3 form a reduced residue system 
modulo 7. 

The integer a is called a primitive root modulo m if a has order y(m). In 
this case, the y(m) integers 1,a,a?,...,a°°—! are relatively prime to m 
and are pairwise incongruent modulo m. Thus, they form a reduced residue 
system modulo m. For example, 3 is a primitive root modulo 7. Similarly, 
3 is a primitive root modulo 10, since y(10) = 4 and 


3° =1 (mod 10), 
3! =3 (mod 10), 
37 =9 (mod 10), 
3° =7 (mod 10), 
34 =1 (mod 10) 


Some moduli do not have primitive roots. There is no primitive root 
modulo 8, for example, since y(8) = 4, but 


17=3?=5°=77=1 (mod 8), (3.1) 


and no integer has order 4 modulo 8. 

In this section we prove that every prime p has a primitive root. In 
Section 3.2 we determine all composite moduli m for which there exist 
primitive roots. 

We begin with some remarks about polynomials. Let R be a commutative 
ring with identity. A polynomial with coefficients in R is an expression of 
the form 


F(a) = aaa + Am—10 1 +++» + art ao, 


where ao, @1,.-.,@m € R. The element a; is called the coefficient of the 
term a’. The degree of the polynomial f(x), denoted by deg(f), is the 
greatest integer n such that a, 4 0, and a, is called the leading coefficient. 
If deg(f) = n, we define a; = 0 for « > n. Nonzero constant polynomials 
f(x) = ao # 0 have degree 0. The zero polynomial f(a) = 0 has no degree. 
A monic polynomial is a polynomial whose leading coefficient is 1. 

We define addition and multiplication of polynomials in the usual way: 
If f(x) = Dp aia* and g(x) = 7g bye’, then 


max(m,n) 


(f+9)(@)= SY) (ae tbe)a* 


k=0 
and 


fg(a) = Sy ie 
k=0 
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where 


k 
Ch = ) aid; = ) ajbe_j- 
i=0 


itjsk 
O0<i<n 
0<j<m 


With this addition and multiplication, the set R[2] of all polynomials with 
coefficients in R is a commutative ring. Moreover, 


deg(f +g) < max(deg(f), deg(g)). 


If f,g € F[z] for some field F’, then 


deg(fg) = deg(f) + deg(g), 


and the leading coefficient of fg is Gmbn. 
For every a € R, the evaluation map O,q : Ri[a] — R defined by 


Oa(f) = f(a) = ana” + ana"! + +++ + a,a + a9 


is a ring homomorphism, that is, (f + g)(a) = f(a) + g(a) and (fg)(a) = 
f(a)g(a). The element a is called a zero or a root of the polynomial f(x) 
if O.(f) = f(a) =0. 

We say that the polynomial d(x) divides the polynomial f(x) if there 
exists a polynomial q(x) such that f(x) = d(x)q(2). 


Theorem 3.1 (Division algorithm for polynomials) Let F be a field. 
If f(x) and d(x) are polynomials in F\a| and if d(x) 4 0, then there exist 
unique polynomials q(a) and r(x) such that f(x) = d(x)q(a) + r(x) and 
either r(x) = 0 or the degree of r(x) is strictly smaller than the degree of 
d(x). 


Proof. Let d(x) = b,x” +--+ + bi” + bo, where b,, # 0 and deg(d) = 
m. If d(x) does not divide f(x), then f — dq # 0 and deg(f — dq) isa 
nonnegative integer for every polynomial g(x) € Fa]. Choose q(x) such 
that @ = deg(f — dq) is minimal, and let 


r(a) = f(x) — d(x)q(x) = cea ++ +e +0 € Fla], 


where ce 4 0. We shall prove that £< m. 
Since F is a field, b;,' € F. If £>m, then 


d(x) bot cent ™ 


is a polynomial of degree @ with leading coefficient ce. Then 


Q(x) = a(x) + by, cen” € Fla, 
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and 


R(x) = f(x) —d(x)Q(z) 
= f(x)—-d(z) (q(x) + be eae) 


= r(x) —d(x)b>)cex*—™ 


is a polynomial of degree at most @— 1. This contradicts the minimality of 
é, and so £< m. 
Next we prove that the polynomials g(x) and r(x) are unique. Suppose 
that 
f(x) = d(x)qi(x) + ri(x) = d(x)qa(x) + r2(2), 


where qi(x), g2(), 71(Z), r2(x) are polynomials in F'[a] such that r;(a) = 0 
or deg(r;) < deg(d) for i = 1,2. Then 


d(x)(qi(@) — g2(@)) = ro(x) — ri (2). 
If q(x) ~ qo(x), then 
deg(d) < deg(d(q — qz2)) = deg(r2 — 1) < deg(d), 


which is absurd. Therefore, qi(z) = q2(x), and so r(x) = ro(x). This 
completes the proof. 


Theorem 3.2 Let f(x) € Fla], f(x) 40, and let No(f) denote the number 
of distinct zeros of f(x) in F. Then No(f) does not exceed the degree of 
f(a), that is, 

No(f) < deg(f). 


Proof. We use the division algorithm for polynomials. Let a € F. Di- 
viding f(x) by «— a, we obtain 


f(a) = (@— a)q(a) + r(a), 


where r(x) = 0 or deg(r) < deg(a—a) = 1, that is, r(x) = ro is a constant. 
Letting x = a, we see that ro = f(a), and so 


f(x) = (@ — a)q(a) + f(a) 


for every a € F. In particular, if a is a zero of f(x), then « — a divides 
(0). 

We prove the theorem by induction on n = deg(f). If n = 0, then f(z) 
is a nonzero constant and No(f) = 0. If n = 1, then f(x) = ap + aye 
with a; # 0, and No(f) = 1 since f(x) has the unique zero a = —a, ao. 
Suppose that n > 2 and the theorem is true for all polynomials of degree 
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at most n—1. If No(f) = 0, we are done. If No(f) > 1, let a € F be a zero 
of f(a). Then 


and 
deg(q) =n-1. 


If 8 is a zero of f(a) and 6 4 a, then 


0= f(8) = (6 — a)a(9), 


and so ( is a zero of q(x). Since deg(q) = n — 1, the induction hypothesis 
implies that 
No(f) < 1+ No(q) < 1+ deg(q) = n. 


This completes the proof. 


Theorem 3.3 Let G be a finite subgroup of the multiplicative group of a 
field. Then G is cyclic. 


Proof. Let |G] = m. By Theorem 2.15, if a € G, then the order of a 
is a divisor of m. For every divisor d of m, let w(d) denote the number 
of elements of G of order d. If w(d) # 0, then there exists an element a 
of order d, and every element of the cyclic subgroup (a) generated by a 
satisfies at = 1. By Theorem 3.2, the polynomial f(x) = x4—1 € F[z] has 
at most d zeros, and so every zero of f(a) belongs to the cyclic subgroup 
(a). In particular, every element of G of order d must belong to (a). By 
Theorem 2.18, a cyclic group of order d has exactly y(d) generators, where 
y(d) is the Euler phi function. Therefore, w(d) = 0 or ~(d) = y(d) for 
every divisor d of m. Since every element of G has order d for some divisor 
d of m, it follows that 


By Theorem 2.8, 


> ¢(d) =m, 


and so w(d) = y(d) for every divisor d of m. In particular, w(m) = y(m) > 
1, and so G is a cyclic group of order m. 


Theorem 3.4 For every prime p, the multiplicative group of the finite field 
Z/pZ is cyclic. This group has y(p—1) generators. Equivalently, for every 
prime p, there exist y(p — 1) pairwise incongruent primitive roots modulo 
p. 
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Proof. This follows immediately from Theorem 3.3, since |(Z/pZ)*| = 
p-l. 


The following table lists the primitive roots for the first six primes. 


p | p(p —1) | primitive roots 


5 
7 
11 
13 


et NOE NO el el 


Let p be a prime, and let g be a primitive root modulo p. If a is an integer 
not divisible by p, then there exists a unique integer & such that 


a=g" (mod p) 


and 
k € {0,1,...,p—2}. 


This integer & is called the index of a with respect to the primitive root g, 
and is denoted by 


k = ind,(a). 
If k, and kg are any integers such that k, < ko and 
a=g" =g* (mod p), 
then 
gj?" =1 (mod p), 
and so 


ky = ko (mod p— 1). 


Ifa=g* (mod p)andb=g‘’ (mod p), then ab= g*g’ = g*t* (mod p), 
and so 
ind,(ab) =k += ind,(a)+ind,(b) (mod p—1). 
The index map ind, is also called the discrete logarithm to the base g 
modulo p. 
For example, 2 is a primitive root modulo 13. Here is a table of ind2(a) 
fora=1,...,12: 


a | indg(a) || @ | indg(a) 
1 0 7 11 
2 1 8 3 
3 4 9 8 
4 2 10 10 
5 9 11 7 
6 5 12 6 
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By Theorem 2.18, if g is a primitive root modulo p, then g* is a primitive 
root if and only if (k, p—1) = 1. For example, for p = 13 there are y(12) = 4 
integers k such that 0 < k < 11 and (k,12) = 1, namely, & = 1,5,7,11, and 
so the four pairwise incongruent primitive roots modulo 13 are 


yg 2 (mod 13), 
2° = 6 (mod 13), 
2° = 11 (mod 13), 
241 = 7 (mod 13). 


Exercises 


1. 


eo on nwn FF WoO WD 


a 
So 


11. 


12. 


Find a primitive root modulo 23. 


. Find a primitive root modulo 41. 

. Prove that 2 is a primitive root modulo 101. 

. Compute ind2(27) modulo 101. 

. Compute ind2(19) modulo 101. 

. What is the order of 3 modulo 101? Is 3 a primitive root modulo 101? 
. Prove that 2 is a primitive root modulo 53. 

. Find all solutions of the congruence 2* = 22 (mod 53). 

. Compute indg(a) for all a not divisible by 53. 


. Let p be an odd prime, and let g be a primitive root modulo p. Prove 


that 
(p= Dis 9? 7" = =): (med p); 


Hint: Observe that 


2 


(p—1)!=1-g-g?-+-gP? 


-g? “(mod p) 


and 


(P= 2p Ap - ppt) 
5 =e (p= 1). 


This gives another proof of Wilson’s theorem (Theorem 2.4). 


Prove that if m has one primitive root, then there are exactly y(y(m)) 
pairwise incongruent primitive roots modulo m. 


Let g and r be primitive roots modulo p. Prove that 
ind,(a) = ind,(a)ind,(g) (mod p— 1) 


for every integer a relatively prime to p. 
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13. Let g be a primitive root modulo the odd prime p. Prove that g?~))/? = 
—1 (mod p). 
14. Let g be a primitive root modulo the odd prime p. Prove that —g is 


a primitive root modulo p if and only ifp=1 (mod 4). 


15. Let f(x) = S07, aiz* and g(x) = Sci.) bia’ be polynomials with 
integer coefficients. Then f(x) and g(a) are called congruent modulo 
m, written f(x) = g(x) (mod m), if aj = 6; (mod m) for i = 
0,1,...,n. Let p be an odd prime, and let 


fe) =a 1 
and 
g(x) = (« —1)(@— 2)---(@- (p— 1). 
Prove the following statements: 
(a) The polynomial f(x) — g(a) has degree p — 2. 
(b) 
f(c) =g(c)=0 (mod p) forc=1,2,...,p—1. 
(c) 
f(x) = g(@) (mod p). 
Hint: Apply Theorem 3.2. 
16. Prove that Exercise (15c) implies Wilson’s theorem, 
(p —1)!=—-1 (mod p). 
17. Prove that for every prime p > 5, 
ij =0 (mod p) 
1<i<j<p-l 


and 
S- ijk =O (mod p). 
1<i<j<k<p-1 
Hint: Exercise (15c). 


18. Let R be acommutative ring with identity. An ideal of Ris an additive 
subgroup I C R such that, if a € I and r € R, then ar € I. Prove 
that if J A {0} is an ideal of the polynomial ring Fz], where F is a 
field, then there is a unique monic polynomial d(a) € I such that I 
consists of all multiples of d(x), that is, 


T= {q(w)d(x) : q(x) € Fla]}. 


Hint: If I # {0}, choose d(x) € I of minimal degree. The proof is 
similar to the proof of Theorem 1.3. 


3.2 Primitive Roots to Composite Moduli 91 


19. Prove that the intersection of a family of ideals is an ideal. This means 
that if {I;}je7 is a family of ideals in the ring R, then I = (),., I; 
is an ideal in R. 


JET 


20. Let Fa] be the ring of polynomials with coefficients in the field F, 
and let f(x),g(a) € Fla]. Prove that there exists a unique monic 
polynomial d(x) € Fa] such that d(x) divides both f(x) and g(z), 
and every common divisor of f(x) and g(a) divides d(x). The poly- 
nomial d(x) is called the greatest common divisor of f(a) and g(x). 


Hint: Consider the ideal I generated by f(a) and g(x), that is, the 
set 


T= {u(x) f(a) + v(a)g(a) : u(x), v(x) € Fla}, 


and apply Exercise 18. 


21. Let f : R— S bea ring homomorphism. Prove that the kernel of f, 
that is, the set 


f-'(0) = {re R: f(r) = 0} 


is an ideal of R. 


22. Let a € F, and let I(a) be the set of all polynomials f(x) € Fla] 
such that f(a) = 0. Prove that I(a) is the kernel of the evaluation 
map 0, and that I(a) is an ideal of F[z]. 


23. Let A be a nonempty subset of F, and let I(A) be the set of all 
polynomials f(a) € F[a] such that f(a) =0 for all a € A. Prove that 
I(A) is an ideal of Fa], and 
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In the previous section we proved that primitive roots exist for every prime 
number. We also observed that primitive roots do not exist for every mod- 
ulus. For example, congruence (3.1) shows that there is no primitive root 
modulo 8. The goal of this section is to prove that an integer m > 2 has a 
primitive root if and only if m = 2,4,p*, or 2p", where p is an odd prime 
and k is a positive integer. 


Theorem 3.5 Let m be a positive integer that is not a power of 2. If m 
has a primitive root, then m = p® or 2p*, where p is an odd prime and k 
18 a positive integer. 
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Proof. Let a and m be integers such that (a,m) = 1 and m > 3. Suppose 
that 


m=mym2, where (m,,m2) = 1 and m1 > 3, m2 > 3. (3.2) 


Then (a,m,) = (a,m2) = 1. The Euler phi function y(m) is even for m > 3 
(Exercise 4 in Section 2.2). Let 


By Euler’s theorem, 
a?) =1 (mod mj), 


and so aye 
y(m 
a” = (arm) } os! (mod my). 


Similarly, 
(m1)/2 
a” = (avers) )" w= (mod mz). 


Since (m1, m2) = 1 and m = m1mzg, we have 
a” =1 (mod m), 


and so the order of a modulo m is strictly smaller than y(m). Consequently, 
if we can factor m in the form (3.2), then there does not exist a primitive 
root modulo m. In particular, if m is divisible by two distinct odd primes, 
then m does not have a primitive root. Similarly, if m = 2°p*, where ¢ > 2, 
then m does not have a primitive root. Therefore, the only moduli m 4 2° 
for which primitive roots can exist are of the form m = p* or m = 2p* for 
some odd prime p. 


To prove the converse of Theorem 3.5, we use the following result about 
the exponential increase in the order of an integer modulo prime powers. 


Theorem 3.6 Let p be an odd prime, and let a 4 +1 be an integer not 
divisible by p. Let d be the order of a modulo p. Let ko be the largest integer 
such that at = 1 (mod p*°). Then the order of a modulo p* is d for 
k=1,...,ko and dp*—" fork > ko. 


Proof. There exists an integer uo such that 
at =1+ pug and (uo, p) = 1. (3.3) 


Let 1 < k < ko, and let e be the order of a modulo p*. Ifa® =1 (mod p*), 
then a® = 1 (mod p), and so d divides e. By (3.3), we have a? = 1 
(mod p*), and so e divides d. It follows that e = d. 
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Let 7 > 0. We shall show that there exists an integer wu; such that 
dp = 1 jtko,,. . — 
ar = 14+ pu; and (u;,p) =1. (3.4) 


The proof is by induction on j. The assertion is true for 7 = 0 by (3.3). 
Suppose we have (3.4) for some integer 7 > 0. By the binomial theorem, 
there exists an integer v; such that 


+1 


aiP = ce + pithoy;)P 


= 14 pitty, + s Cee 
1=2 

= Lp pittthoy, 4 pitrthoy, 

= 1+ pit (us + pos) 

Se iss, 


and the integer u;+1 = uj + pv; is relatively prime to p. Thus, (3.4) holds 
for all 7 > 0. 

Let k > ko +1 and 7 = k—ko > 1. Suppose that the order of a modulo 
p*—! is dpi—1. Let e, denote the order of a modulo p*. The congruence 


a®* =1 (mod p*) 


implies that 


and so dp’—! divides e,. Since 


a = 1 + p*~tuj-1 #1. (mod p*), 
it follows that dp’—! is a proper divisor of e,. On the other hand, 
a? =1 + peu; =1 (mod p*), 


and so Ck divides dp’. It follows that the order of a modulo p* is exactly 
ex, = dp’ = dp*—*©. This completes the proof. 


Theorem 3.7 Let p be an odd prime. If g is a primitive root modulo p, 
then either g or g+p is a primitive root modulo p® for all k > 2. If g is a 
primitive root modulo p* and g, € {g,g+p*} is odd, then g, is a primitive 
root modulo 2p*. 


Proof. Let g be a primitive root modulo p. The order of g modulo p 
is p—1. Let ko be the largest integer such that p*° divides g?~! — 1. By 
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Theorem 3.6, if ko = 1, then the order of g modulo p* is (p—1)p*~! = y(p*), 
and g is a primitive root modulo p* for all k > 1. 
If ko > 2, then 
gt =1tpey 


for some integer v. By the binomial theorem, 


p-1 = pot p-1-i,i 
(9 +P) =» , jo 


i=0 
= g?*+(p—1)g?-*p (mod p’) 
= 1+p’v+g?*p*—g?*p (mod p”) 
1—g°~*p (mod p’”) 
# 1 (mod p’). 


Then g + p is a primitive root modulo p such that 


(g+p)?-1=1+4 puo and (wo, p) = 1. 


Therefore, g + p is a primitive root modulo p* for all k > 1. 

Next we prove that primitive roots exist for all moduli of the form 2p”. If 
g is a primitive root modulo p*, then g+p* is also a primitive root modulo 
p®. Since p® is odd, it follows that one of the two integers g and g + p* is 
odd, and the other is even. Let g, be the odd integer in the set {g,g+p*}. 
Since (g + p*, p*) = (g, p*) = 1, it follows that (g1,2p*) = 1. The order of 
gi modulo 2p" is not less than y(p*), which is the order of g; modulo p*, 
and not greater than y(2p"). However, since p is an odd prime, we have 


y(2p*) = y(p*), 


and so g; has order y(2p*) modulo 2p*, that is, g; is a primitive root 
modulo 2p". This completes the proof. 


For example, 2 is a primitive root modulo 3. Since 3 is the greatest power 
of 3 that divides 2? — 1, it follows that 2 is a primitive root modulo 3” for 
all k > 1, and 2 + 3" is a primitive root modulo 2- 3° for all k > 1. 

Finally, we consider primitive roots modulo powers of 2. 


Theorem 3.8 There exists a primitive root modulo m = 2" if and only if 
m= 2 or 4. 


Proof. We note that 1 is a primitive root modulo 2, and 8 is a primitive 
root modulo 4. We shall prove that if k > 3, then there is no primitive root 
modulo 2*. Since y(2") = 2°~1, it suffices to show that 


gk-2 


a 21. (mod 2) (3.5) 
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for a odd and k > 3. We do this by induction on k. The case k = 3 is 
congruence (3.1). Let k > 3, and suppose that (3.5) is true. Then 


a” 1 

is divisible by 2". Since a is odd, it follows that 
ae +1 

is even. Therefore, 


Py ia a (a = 1) (Gar 4 1) 


is divisible by 2*+1, and so 


This completes the induction and the proof of theorem. 


Let k > 3. By Theorem 3.8, there is no primitive root modulo 2", that 
is, there does not exist an odd integer whose order modulo 2" is 2*7!. 
However, there do exist odd integers of order 2*~? modulo 2*. 


Theorem 3.9 For every positive integer k, 
5 S43: 0"?? (mod 2°"), 
Proof. The proof is by induction on k. For k = 1 we have 
5? =25=143-23 (mod 2°). 
Similarly, for k = 2 we have 
5? = 625 =1+448+576 =1+3-24 (mod 2°). 


If the theorem holds for k > 1, then there exists an integer u such that 


5? = 143-2842 4 okt4y = 14 24+2(3 4 dy). 
Since 2k + 4 >k+5, we have 


gore = Ce): 
= (1+24(3 4 4u))” 


= 142**3(344u) (mod 2?*+*) 
= 143-2"? (mod 2°"). 


This completes the proof. 
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Theorem 3.10 If k > 3, then 5 has order 2*~? modulo 2*. Ifa = 1 
(mod 4), then there exists a unique integer i € {0,1,...,2*~? — 1} such 
that 

a=5' (mod 2°). 
Ifa=3 (mod 4), then there exists a unique integer i € {0,1,...,2*-?—-1} 
such that 

a@=-—5* (mod 2°). 


Proof. In the case k = 3, we observe that 5 has order 2 modulo 8, and 


1 = 5° (mod 8), 
3 = =—5) (mod 8), 
5 = 51 (mod 8), 
7 = -5° (mod 8). 
Let k > 4. By Theorem 3.9, we have 
So oe Pee Gaed okt?) 
= 1 (mod 2*) 
and 
Ber ee ES RSL. Tusa Beet) 


= 143-2"! (mod 2°) 

# 1 (mod 2*), 
Therefore, 5 has order exactly 2*-? modulo 2", and so the integers 5’ are 
pairwise incongruent modulo 2* for i = 0,1,...,2*~? — 1. Since 5’ = 1 
(mod 4) for all i, and since exactly half, that is, 2’~?, of the 2*-1 odd 
numbers between 0 and 2* are congruent to 1 modulo 4, it follows that the 
congruence 

5} =a (mod 2° 

is solvable for every a = 1 (mod 4). Ifa = 3 (mod 4), then —a = 1 
(mod 4) and so the congruence 


—a=5' (mod 2*), 


or, equivalently, 
a=—5' (mod 2°), 


is solvable. This completes the proof. 


In algebraic language, Theorem 3.10 states that for all k > 3, 
(Z/2*Z)* = (-1) x (5) & Z/2Z x Z/2*-°Z, 


where (a) denotes the cyclic subgroup of (Z/2*Z)* generated by a for 
a=-—landa=5. 
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Exercises 


1. Find an integer g that is a primitive root modulo 5” for all k > 1. 
Find a primitive root modulo 10. Find a primitive root modulo 50. 


2. For k > 1, let e, be the order of 5 modulo 3”. Prove that 


ey, = 2-3*-1. 


3. Prove that p divides the binomial coefficient (°) fori =1,2,...,p—1. 


4. Prove that if g is a primitive root modulo p?, then g is a primitive 
root modulo p* for all k > 2. 


5. Let p be an odd prime. Prove that 
(1+ px)?" =1+p*t4x (mod p*t?) 
for every integer x and every nonnegative integer k. 


6. (Nathanson [100]; see also Wagstaff [151]) Let p be an odd prime, 
and let a £ +1 be an integer not divisible by p. Let d be the order 
of a modulo p, and let ko be the largest integer such that a4 = 
1 (mod p**), Prove that if k > ko is a solution of the exponential 


congruence 
a®=1 (mod p*), (3.6) 
then 
k d 
D a 
a i 


and so congruence (3.6) has only finitely many solutions. 
Hint: Apply Theorem 3.6. 


7. Use Exercise 6 to prove that the exponential congruence 
9§=1 (mod 7*) 
has no solutions. 
8. Find all solutions of the exponential congruence 


17*=1 (mod 15*). 


9. Find all solutions of the exponential congruence 


3*=1 (mod 2*). 
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10. Let {x} denote the fractional part of x. Compute 


forn =1,...,10. Let r,, be the least nonnegative residue of 3” modulo 


2”. Show that 
3\" Py 
2 — gn 


Remark. It is an important unsolved problem in number theory to 
understand the distribution of the fractional parts of the powers of 
3/2 in the interval [0, 1). 


3.3. Power Residues 


Let m,k, and a be integers such that m > 2, k > 2, and (a,m) = 1. We 
say that ais a kth power residue modulo m if there exists an integer 7 such 
that 

a* =a (mod m). 
If this congruence has no solution, then a is called a kth power nonresidue 
modulo m. 

Let k = 2 and (a,m) = 1. If the congruence x? = a (mod m) is solv- 
able, then a is called a quadratic residue modulo m. Otherwise, a is called a 
quadratic nonresidue modulo m. For example, the quadratic residues mod- 
ulo 7 are 1,2, and 4; the quadratic nonresidues are 3,5, and 6. The only 
quadratic residue modulo 8 is 1, and the quadratic nonresidues modulo 8 
are 3,5,4 and 7. 

Let k = 3 and (a,m) = 1. If the congruence x? = a (mod m) is solvable, 
then a is called a cubic residue modulo m. Otherwise, a is called a cubic 
nonresidue modulo m. For example, the cubic residues modulo 7 are 1 and 
6; the cubic nonresidues are 2, 3, 4, and 5. The cubic residues modulo 5 
are 1, 2, 3, and 4; there are no cubic nonresidues modulo 5. 

In this and the next two sections we investigate power residues modulo 
primes. In Section 3.6 we consider quadratic residues to composite moduli. 


Theorem 3.11 Let p be prime, k > 2, and d = (k,p—1). Let a be an 
integer not divisible by p. Let g be a primitive root modulo p, Then a is a 
kth power residue modulo p if and only if 


indg(a) =0 (mod d) 


if and only if 
a®-))/4 =] (mod p). 
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Ifa is akth power residue modulo p, then the congruence 


z* =a (mod p) (3.7) 


has exactly d solutions that are pairwise incongruent modulo p. Moreover, 
there are exactly (p—1)/d pairwise incongruent kth power residues modulo 
p. 


Proof. Let ¢ = ind,(a), where g is a primitive root modulo p. Congru- 
ence (3.7) is solvable if and only if there exists an integer y such that 


and 
This is equivalent to 

ky = (mod p—1). (3.8) 
This linear congruence in y has a solution if and only if 

ind,(a) =€=0 (mod d), 

where d = (k,p— 1). Thus, the kth power residues modulo p are precisely 
the integers in the (p—1)/d congruence classes g’4+ pZ for i =0,1,...,(p— 
1)/d — 1. Moreover, 


a(P-1)/4 = gle-Né/d = 1 (mod p) 


if and only if 


=0 (mod p—1) 


if and only if 
ind,(a)=£=0 (mod d). 


Finally, if the linear congruence (3.8) is solvable, then by Theorem 2.2 
it has exactly d solutions y that are pairwise incongruent modulo p — 1, 
and so (3.7) has exactly d solutions x = g¥ that are pairwise incongruent 
modulo p. This completes the proof. 


For example, let p = 19 and k = 3. Then d = (k, p—1) = (3,18) = 3. We 
can check that 2 is a primitive root modulo 19, and so a is a cubic residue 
modulo 19 if and only if 3 divides indg(a). Since —1 = 2° (mod 3) and 
ind2(—1) = 9, it follows that —1 is a cubic residue modulo 19. The solutions 
of the congruence 2? = —1 (mod 19) are of the form x = 2¥ (mod 19), 
where 0 < y < 17 and 3y = 9 (mod 18). Then y= 3 (mod 6), and so 
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y = 3,9, and 15. These give the following three cube roots of —1 modulo 
19: 
8=2° (mod 19), 


18=2° (mod 19), 


and 
12=2 (mod 19). 


Corollary 3.1 Let p be an odd prime, and let k > 2 be an integer such 
that (k,p —1) = 1. If (a,p) = 1, then a is a kth power residue modulo p, 
and the congruence x” =a (mod p) has a unique solution modulo p. 


Exercises 
1. Find all cubic residues modulo 19. 


2. Find all solutions of the congruence x? = (mod 19). 


3. Define the map f : (Z/19Z)* — (Z/19Z)* by f(a +19Z) = 23+ 
19Z. Prove that f is a homomorphism of the multiplicative group 
(Z/19Z)*, and compute its kernel. 


4. Find all fifth power residues modulo 11. 
5. Find all sixth power residues modulo 11. 


6. Define the map f : (Z/23Z)* — (Z/23Z)* by f(a+23Z) = x? +23Z. 
Prove that f is a isomorphism of the multiplicative group (Z/23Z)*, 
that is, prove that f is a homomorphism that is one-to-one and onto. 


7. Let xq be the least nonnegative integer such that #3 =a (mod 11). 
Compute x, for a= 1,2,...,10. 


8. Prove that if p=2 (mod 3), then every integer not divisible by p is 
a cubic residue modulo p. 


9. Prove that ifp =1 (mod 6), then the product of the (p—1)/3 cubic 
residues modulo p is congruent to —1 modulo p. 


3.4 Quadratic Residues 


Let p be an odd prime and a an integer not divisible by p. Then a is called 
a quadratic residue modulo p if there exists an integer x such that 


x? =a (mod p). (3.9) 
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If this congruence has no solution, then a is called a quadratic nonresidue 
modulo p. Thus, an integer a is a quadratic residue modulo p if and only 
if (a,p) = 1 and a has a square root modulo p. By Theorem 3.11, exactly 
half the congruence classes relatively prime to p have square roots modulo 


We define the Legendre symbol for the odd prime p as follows: For any 
integer a, 


1 if (a,p) = 1 and a is a quadratic residue modulo p, 
a E Z : ; 
(=) =< —1 if (a,p) = 1 and a is a quadratic nonresidue modulo p, 
P 0 if p divides a. 


The solvability of congruence (3.9) depends only on the congruence class 
ofa (mod p), that is, 


(2) ss (2) ifa=b (mod p), 


and so the Legendre symbol is a well-defined function on the congruence 
classes Z/pZ. 

We observe that if p is an odd prime, then, by Theorem 3.2, the only 
solutions of the congruence x? = 1 (mod p) arex=+1 (mod p). More- 
over, if e,e’ € {-1,0,1} ande =e’ (mod p), then p divides ¢ — e’, and so 


e =e’. In particular, if (3) =e (mod p), then (2) =e. 


Theorem 3.12 Let p be an odd prime. For every integer a, 


(=) =a-/? (mod p). 


Proof. If p divides a, then both sides of the congruence are 0. If p does 
not divide a, then, by Fermat’s theorem, 


2 
(ae-/2) = qe =1 (mod D), 


and so 
q(e-)/2 


=+1 (mod p). 
Applying Theorem 3.11 with k = 2, we have 


a®-Y/? =1 (mod p)_ if and only if (2) =1, 
Pp 
and so 


a®-Y/2 = _1 (mod p)_ if and only if (2) =-1. 
Pp 
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This completes the proof. 


For example, 3 is a quadratic residue modulo the primes 11 and 13, and 
a quadratic nonresidue modulo the primes 17 and 19, because 


3 — 95 — 
(Z)= =1 (mod 11), 


3 — 96 — 
(3) = =1 (mod 13), 
3) 2382-1 (mod 17) 
ive i 


(3) =39=-1 (mod 19). 


The next result states that the Legendre symbol is a completely multi- 
plicative arithmetic function. 


Theorem 3.13 Let p be an odd prime, and let a and b be integers. Then 


ab a b 
Gaol! 
Proof. If p divides a or b, then p divides ab, and 
ie) 2G): 
Pp Pp Pp 
If p does not divide ab, then, by Theorem 3.12, 


(2) = (ab)®-Y/2 (mod p) 


Pp 


q-))/2p(P-1)/2 (mod p) 


= (5)(2) min 


The result follows immediately from the observation that each side of this 
congruence is +1. 


Theorem 3.13 implies that the Legendre symbol (5) is completely de- 
termined by its values at —1, 2, and odd primes gq. If a is an integer not 
divisible by p, then we can write 


_ 19ro 71,72 Tk 
@ = +2"°qn" Ge “dks 
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where qi,..-,Q, are distinct odd primes not equal to p. Then 


Cg URC kk 


We shall first determine the set of primes p for which —1 is a quadratic 
residue. By the following result, this depends only on the congruence class 
of p modulo 4. 


Theorem 3.14 Let p be an odd prime number. Then 
a oe 1 if p=1 (mod 4), 
p) \-1 tf p=3 (mod 4). 


Equivalently, 


Proof. We observe that 


(-1je-Y? — 1 if p=l (mod 4), 
-1 if p=3 


Applying Theorem 3.12 with a = —1, we obtain 


(=) =(-)eY? (mod p). 


Again, the theorem follows immediately from the observation that both 
sides of this congruence are +1. 


Let p be an odd prime, and let S be a set of (p — 1)/2 integers. We call 
Sa Gaussian set modulo p if SU—-S = SU{-s: s € S} is a reduced 
system of residues modulo p. Equivalently, S is a Gaussian set if for every 
integer a not divisible by p, there exist s € S and e € {1,—1} such that 
a = es (mod p). Moreover, s and < are uniquely determined by a. For 
example, the sets {1,2,...,(p—1)/2} and {2,4,6,...,p—1} are Gaussian 
sets modulo p for every odd prime p. If S is a Gaussian set, s,s’ € S, and 


s=+s' (mod p), then s= 5s’. 


Theorem 3.15 (Gauss’s lemma) Let p be an odd prime, and a an in- 
teger not divisible by p. Let S be a Gaussian set modulo p. For every s € S 
there exist unique integers ua(s) € S and €,(s) € {1,—1} such that 

aS = €q(s)uUa(s) (mod p). 
Moreover, 


(<) = [J cals) = (-0)", 


ses 
where m is the number of s € S such that €q(s) = —1. 
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Proof. Since S$ is a Gaussian set, for every s € S there exist unique 
integers ua(s) € S and €(s) € {1,—1} such that 
aS = Eq(s)uUa(s) (mod p). 
Let s,s’ € S. If ua(s) = ua(s’), then 


asl) = Ea(S’)Ua(s’) = Ea(s')Ua(s) (mod p) 
= Ea(8')E€a(S)Ea(S)Ua(s) (mod p) 
= tas (mod p). 


Dividing by a, we obtain 


/ 


s'=+s (mod p), 


and so s’ = s. It follows that the map ug : S — S is a permutation of S, 
and so 


Therefore, 


Dividing by [],-¢ 8, we obtain 


(5) = al?) = TT] cals) (mod p). 


ses 


ses 


The proof is completed by the observation that the right and left sides of 
this congruence are +1. 


We shall use Gauss’s lemma to compute the Legendre symbol (;%). Let 
S be the Gaussian set {2,4,6,8,10}. We have 


3-2 = 6 (mod 11), 
3-4 = (-1)10 (mod 11), 
3-6 = (-1)4 (mod 11), 
3-8 = 2 (mod 11), 
3-10 8 (mod 11). 
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The number of s € S' with ¢3(s) = —1 is m = 2, and so (4) = (-1)?=1, 
that is, 3 is a quadratic residue modulo 11. Tadeee. 

5?=67=3 (mod 11), 
and so 5 and 6 are the square roots of 3 modulo 11. 


Theorem 3.16 Let p be an odd prime. Then 


2\ fil ifp=+1 (mod 8), 
p)  \ -l ifp=+3 (mod 8). 


(=) = (-1)@"-D/8, 


Proof. We apply Gauss’s lemma (Theorem 3.15) to the Gaussian set 
S = {1,2,3,...,(p—1)/2}. Then 


Equivalently, 


{25:5 ¢€ S}= {2,4,6,...,p—1}, 


and 


where m is the number of integers s € S such that ¢2(s) = —1. If 1 <2s< 
(p — 1)/2, then 2s € S, and so u2(s) = 2s and €9(s) = 1. If (p+1)/2 < 
2s <p-—1, then 1 < p— 2s < (p—1)/2, and so p— 2s € S. Since 


2s =—(p—2s) (mod p), 


it follows that u2(s) = p— 2s and €9(s) = —1. Therefore, m is the number 
of integers s € S such that (p+ 1)/2 < 2s < p—1, or, equivalently, 
p+l p—-—l 
es 3.10 
poe ey) 
Since every odd prime p is congruent to 1, 3, 5, or 7 modulo 8, there are 
four cases to consider. 


(i) Ifp=1 (mod 8), then p = 8k +1, and s € S satisfies (3.10) if and 
only if 
1 


and som = 2k and (2) = (—1)?* 
(ii) Ifp =3 (mod 8), then p = 8k+3, and s € S satisfies (3.10) if and 


only if 
2k+1<s<4k+1, 


and som = 2k+1 and (2) = (-1)7**1 = -1. 
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(iii) Ifp=5 (mod 8), then p= 8k+5, and s € S satisfies (3.10) if and 


only if 
1 
2k+1+5 <9 <4k+2, 


and som = 2k+1 and (2) = (—1)?*+1 =e 


(iv) Ifp=7 (mod 8), then p = 8k+7, and s € S satisfies (3.10) if and 


only if 
2k+2< 8s <4k4+3, 


and som = 2k+ 2 and (2) = (—1)?* +? =. 


Finally, we observe that 


and 


This completes the proof. 


=0 (mod2) ifp=lor7 (mod 8) 


p’-1 
8 


=1 (mod2) ifp=3or7 (mod 8). 


Exercises 


1. Find all solutions of the congruences x? = 2 (mod 47) and x? = 2 


(mod 53). 


. Prove that S = {3,4,5,9,10} is a Gaussian set modulo 11. Apply 


Gauss’s lemma to this set to compute the Legendre symbols (+) 


: 11 
and (5) 


. Let p be an odd prime. Prove that {2,4,6,...,p — 1} is a Gaussian 


set modulo p. 


. Use Theorem 3.14 and Theorem 3.16 to find all primes p for which 


—2 is a quadratic residue. 


. Use Gauss’s lemma to find all primes p for which —2 is a quadratic 


residue. 


. Use Gauss’s lemma to find all primes p for which 3 is a quadratic 


residue. 


. Find all primes p for which 4 is a quadratic residue. 
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8. Let p be an odd prime. Prove that the Legendre symbol is a homo- 
morphism from the multiplicative group (Z/pZ)* into {+1}. What 
is the kernel of this homomorphism? 


9. For every odd prime p, define the Mersenne number 
My, = 2? —-1. 


A prime number of the form M, is called a Mersenne prime (see 
Exercise 5 in Section 1.5). 
Let q be a prime divisor of Mp. 
(a) Prove that 2 has order p modulo gq, and so p divides q — 1. 
Hint: Fermat’s theorem. 
(b) Prove that p divides (q — 1)/2, and so 


q=1 (mod 2p) 
and 
2°9-)/2 =1 (mod q). 
Hint: Both p and q are odd. 


(c) Prove that (2) =1,andsoqg=+1 (mod 8). 


10. For every positive integer n, define the Fermat number 
Fy, = 27°41, 
A prime number of the form F;, is called a Fermat prime (see Exer- 
cise 7 in Section 1.5). 
Let n > 2, and let gq be a prime divisor of Fy. 
(a) Prove that 2 has order 2”*! modulo gq. 


Hint: Exercise 8 in Section 2.5. 


(b) Prove that 
q=1 (mod 2”**), 


fe) 
YN 


Prove that there exists an integer a such that 


gn St (mod q). 


Hint: Observe that (2) =1,andso2=a? (mod q). 


(d) Prove that 
q=1 (mod 2”*?), 
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11. 


12. 


13. 


14. 


3. Primitive Roots and Quadratic Reciprocity 


Remark. By Exercise 7 in Section 1.5, the Fermat number Fs is di- 
visible by the prime 641, and 641=1 (mod 2°). 


A binary quadratic form is a polynomial 
f(x,y) = ax? + bay + cy”, where a,b,c are integers. 
The discriminant of this form is the integer d = b? — 4ac. Show that 


4af (x,y) = (Qax + by)? — dy’. 


Let p be an odd prime, and let f(x,y) = ax? + bry + cy? be a 
binary quadratic form with a # 0 (mod p). We say that f(z, y) 
has a nontrivial solution modulo p if there exist integers x and y 
not both divisible by p such that f(z,y) =0 (mod p). Prove that 
f(x,y) has a nontrivial solution modulo p if and only if either d = 0 
(mod p) or d is a quadratic residue modulo p. 


Prove that the binary quadratic form 
f(x,y) = 2a? — 15ay + 27y’ 


has a nontrivial solution modulo p for all primes p. Find a nontrivial 
solution of the congruence 


f(x,y) =0 (mod 11). 


Let p and q be distinct odd prime numbers. Prove that 


SS (2—*1) =1 (mod q), 


ay+---+aq=q (mod p) P 
l<aj<p-1 


where the sum is over all ordered g-tuples of integers (a1,...,2q) such 
that 71 +---+a%,=q (mod p) andl<a;<p-—1fori=1,...,¢. 


Hint: If qx = q (mod p), then « = 1 (mod p). If the qtuple 
(%1,...,%q) contains k distinct integers y1,..., yx such that integer 
yj appears u; times in the g-tuple, so that es ujyj =@ (mod p) 
and ee uj = q, then the number of permutations of this g-tuple is 


the multinomial coefficient (axa): Show that 


(=) =0 (mod q). 
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Let p and q be distinct odd primes. If g is a quadratic residue modulo p, 
then the congruence 
x? =q (mod p) 


is solvable. Similarly, if p is a quadratic residue modulo gq, then the congru- 
ence 
x? =p (mod q) 


is solvable. There is no obvious connection between these two congruences. 
One of the great discoveries of eighteenth-century mathematics is that there 
is, in fact, a subtle and powerful relation between them that depends only 
on the congruence classes of the primes p and g modulo 4. This is expressed 
in Gauss’s celebrated law of quadratic reciprocity. 


Theorem 3.17 (Quadratic reciprocity) Let p and q be distinct odd primes. 
Ifp=1 (mod 4) org=1 (mod 4), then p is a quadratic residue modulo 

q if and only if q is a quadratic residue modulo p. Ifp =q=3 (mod 4), 
then p is a quadratic residue modulo q if and only if q is a quadratic non- 
residue modulo p. Equivalently, 


(2) (2) cae 


S102 cos (p= Dp} 


Proof. Let 


and 
T = {1,2,...,(q—1)/2}. 


Then S is a Gaussian set for the prime p, and T is a Gaussian set for the 
prime q. Let 
SxT={(s,t):seESteT}. 


This is a rectangle of lattice points in R? of cardinality 


p-lq-1 


T|= 
|S x T| 5 5 


We shall count the number m of lattice points (s,¢) in this rectangle that 
lie in the strip defined by the inequality 


=i 
1<pt—qs< ">. (3.11) 


(To understand this proof, it is helpful to choose small primes, for example, 
p = 17, ¢ = 18, and draw pictures of the rectangle S x T and the regions 
defined by inequalities.) 
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If s € S, ti,t2 € T, and the lattice points (s,t,) and (s,t2) both sat- 
isfy (3.11), then 


p-l1 
p|t: — t2| = |(pt: — qs) — (pte — gs)| < ogy 


and so ty = tg. It follows that for every s € S there exists at most one 
t € T that satisfies (3.11). If this inequality holds for some t € T, then 
pt —qs=s' € S, and 


/ 


qs =-—s’ (mod p). 
Using the notation in Gauss’s lemma (Theorem 3.15), we have u,(s) = s’ 


and ¢,(s) = —1. 
Conversely, if s € S and e,(s) = —1, then 


qs = —Uq(s) (mod p), 


and there exists an integer ¢t such that 


Since 


it follows that 
(¢+1)(p— 1) ott 


1<t< 
eee 2p 2 


The prime q is odd, and so 


i<pet—. 
Cee 22 
Therefore, t € T, and the lattice point (s,t) € S x T satisfies inequal- 
ity (3.11). Thus, the number m of lattice points (s,t) € S x T that satisfy 
inequality (3.11) is equal to the number of s € S such that ¢,(s) = —1. By 
Gauss’s lemma, 


Similarly, 


where n is the number of lattice points (s,t) € S x T such that 


—1 
1 <qs—pt <<. 
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or, equivalently, 


1 
<pt—qs<-l. (3.12) 


Since pt — qs £0 for alls € S andt € T, it follows that 


(2) (2)-am 


where m+ n is the number of lattice points (s,t) € S x T such that 


=1 —1 
Z < pt qs so 


(3.13) 


Let M denote the number of lattice points (s,t) € S x T such that 


p-l 
t — 2 
Pp qs 2 


and let N denote the number of lattice points (s,t) € S x T such that 
qd 
t— qs << ———. 
pl — qs 5) 
Then 


p-1q-1 
a ae 


We define a map from the set S x T to itself by reflection: 


m+n+M+N=(|SxT|= 


(s,t) > (s',), 


where 

Pe see 

2 
and 
1 

f= -t. 
This map is a bijection, since 

ae eee 

5 = 
and 
1 

tat 

If (s,t) ¢ Sx T and 
pt—qs > 2, 
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then (s’,t’) ¢ S x T and 


1 1 
pt'—qs' = (4 7 (8 :) 


I 

| 
3 
= 

| 
+ 
Q 
w 


I 
S 
Se 
a 
& 
+ 


< 


Therefore, M < N. Similarly, if (s,¢) € S x T and 


cael 
fase: 
pt — qs 5 


then (s’,t’) ¢ S x T and 


and so M > N. Therefore, M = N and 


(0G) ~ core -taren 


Bat got 


& (es = (—1) 3 zr, 


This completes the proof. 


The quadratic reciprocity law provides an effective method to calculate 
the value of the Legendre symbol. For example, since 7 = 59 =3 (mod 4) 


and 59=3 (mod 7), we have 
59\ 3 
a 7 


(=) 


| 
as 
wil Nn 
Sy 

lI 
oN 
wl 
Sy 


Similarly, since 51 = 3-17 and 97=17=1 (mod 4), we have 
51 3 sg eee acl we eo 
97 97) \97/ \3 17 
Ly. (121 12 
BP RIT NAG 


I 


I 
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7 (=) (7) = @ 


Quadratic reciprocity also allows us to determine all primes p for which a 
given integer a is a quadratic residue. Here are some examples. If a = 5, 


© eae 


Let a=7.Ifp=1 (mod 4), then 


1,4 (mod 5), 
2,3 (mod 5). 


sS'3 


7 = (2) = 1 ifp=1,2,4 (mod 7), 
p) \7/ | -1 ifp=3,5,6 (mod 7). 


Ifp=83 (mod 4), then 


Equivalently, 


7\ _f 1 ifp=1,3,9,19,25,27 (mod 28), 
p) | -1. ifp=5,11,13,15,17,23 (mod 28). 


(=) 


p=1,4 (mod 5) and p=1,3,9,19,25,27 (mod 28) 


Let a = 35. Then 


if and only if 


or 


p=2,3 (mod5) and p=5,11,13,15,17,23 (mod 28). 


This is equivalent to a set of congruence classes modulo 140. 


Exercises 


1. Let p = 11 and q = 7. Using the notation in the proof of the law 
of quadratic reciprocity (Theorem 3.17), we have m+n+M+N= 
|S x T| = 15. Compute the numbers m,n, M, and N. Check that 


(3) = (-))™ and (+) = (-1)". 


2. Use quadratic reciprocity to compute (4). Find an integer x such 
that 2? =7 (mod 48). 
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10. 


11. 
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Use quadratic reciprocity to compute (+3) . Find an integer x such 
that z?=19 (mod 101). 


. Prove that the congruence 


(x? — 2)(x? — 17)(x? — 34) =0 (mod p) 
has a solution for every prime number p. 


Use quaratic reciprocity to find all primes p for which —2 is a quadratic 
residue. 


Use quaratic reciprocity to find all primes p for which 3 is a quadratic 
residue. 


Find all primes for which —3 is a quadratic residue. 
Find all primes for which 5 is a quadratic residue. 
Find all primes for which —5 is a quadratic residue. 


Find all primes p for which the binary quadratic form f(x, y) = 2? + 
xy +y" has a nontrivial solution modulo p. 


Hint: Apply Exercise 11 in Section 3.4. 


In Exercises 11-17 we derive properties of the Jacobi symbol, which 
is a generalization of the Legendre symbol to composite moduli. Let 
m be an odd positive integer, and let 


ra 
ky 
m=T]o 
w=1 


be the factorization of m into the product of powers of distinct prime 
numbers. For any nonzero integer a, we define the Jacobi symbol ( ) 


as follows: “ 
= whe: i 
m) Pi ; 


i=l 


(a) Prove that ifa@=6 (mod ™m), then 


(b) For any integers a and 8, prove that 


Gaole. 


12. 
13. 


14. 


15. 


16. 


17. 
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(c) Prove that (+) = 0 if and only if (a,m) > 1. 


a 
m 


Compute the Jacobi symbol (35). 


Let m be an odd positive integer, and let (a,m) = 1. The integer a is 
called a quadratic residue modulo m if there exists an integer x such 
that 
x? =a (mod m) 

and a quadratic nonresidue modulo m if this congruence has no solu- 
tion. Prove that if (+) = —1, then ais a quadratic nonresidue modulo 
m. Prove that a is not necessarily a quadratic residue modulo m if 
Ce 


Hint: Consider m = 21 and a= —1. 
Let m = p*, where p is an odd prime and k > 1. Prove that 


m—-1_ k(p-1) 
cs 5 (mod 2). 


Hint: Use the binomial theorem to expand m = ((p—1) + 1)*. 


Let m be an odd positive integer with standard factorization m = 
TLL, pt. Prove that 


a 


m—1 = ki(p; — 1) 
= 2). 
5 dX 5 (mod 2) 
Hint: Use induction on r. 


Prove that 
eds = ie ee, 
m 


Let m be an odd positive integer with standard factorization m = 
TI, ph. Prove that 


and 


Let m and n be relatively prime odd positive integers with standard 


factorizations 
‘s 
ki 
m=] 
i=l 
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and ; 
t5 
n=|[ <a’. 
j=l 
Prove that 
m= T. ae m= fq —1 
seca, = ib; i 
> 2 mes (2S) (44) tna 2 
t=1 g=1 
and 


3.6 Quadratic Residues to Composite Moduli 


Let m be an odd positive integer and a an integer relatively prime to m. 
We shall prove that a is a quadratic residue modulo m if and only if a is a 
quadratic residue modulo p for every prime p that divides m. The Chinese 
remainder theorem (see Theorem 2.11) implies that it suffices to consider 
congruences modulo prime powers. 

We begin with Hensel’s lemma, an important result that gives a sufficient 
condition that a polynomial congruence solvable modulo a prime p will also 
be solvable modulo p* for every positive integer k. 

Let 

f(a) Sana" + ane" 1 +--+ ae + a9 
be a polynomial with coefficients in a ring R. The derivative of f(x) is the 
polynomial 


f'(a) = naga”) + (n= Vania”? + +--+ a. 
If f(a) is a polynomial of degree n > 1 with coefficients in the ring Z, 
then the derivative f’(x) has degree n — 1 and leading coefficient na,,. For 
example, if f(z) = 2° — 5a +1, then f’(x) = 3x? — 5. Moreover, 
fla+h) = (@+h)?-—5(e+h) +1 

= (#9 +327h+ 32h? +h?) —(5¢+5h) +1 
(x? — 5a +1) + (32? —5)h+ (82 + h)h? 

= f(x) + f'@)h+ r(x, hyh?, 
where r(x, h) = 3a + h. 


l 


Theorem 3.18 Let R be a ring and f(x) = >}, aix* a polynomial with 
coefficients in R. Then 


f(a th) = f(x) + fi(@)h+r(@, Ah’. 


where r(x,h) is a polynomial in the two variables x and h with coefficients 
in R. 
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Proof. This is a standard calculation. Expanding f(x +h) by the bino- 
mial theorem, we obtain 


f(x+h) = a(x +h)’ 

i=0 

= Sas (ee 
70: - j=0 

EE (Jem 
j=0 ij 

= Sia a,x 4+ Soin C mtd (; a Ips 
i=0 j=2 i=j 


= f(x) + fi(e)htr(a, hh’, 


n 
=> (jah in 
jH2 i=j 
is a polynomial in x and h with coefficients in R. 


where 


Theorem 3.19 (Hensel’s lemma) Let p be prime, and let f(x) be a 
polynomial of degree n with integer coefficients and leading coefficient not 
divisible by p. If there exists an integer x1 such that 


f(x1) =0 (mod p) 
and 

f'(x1) #0 (mod p), 
then for every k > 2 there exists an integer x, such that 

f(ar) =0 (mod p*) (3.14) 
and 


tp =p (mod p*-!), (3.15) 


Proof. The proof is by induction on k. We begin by constructing x2. 
There exist integers uy and v; such that f(a1) = uip and f’(x1) = v1 #0 
(mod p). We shall prove that there exists an integer y,; such that f(a, + 
yip) =0 (mod p*). 

By Theorem 3.18, there exists a polynomial r(x, h) with integer coeffi- 
cients such that 


f(tit+yp) = f(a) + f'(ei)yip +r (21, yp)” 


= uptuyptr(xi,yip)p" 
= wptuiyip (mod p’). 
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Therefore, there exists an integer y1 such that 
f(t1+y1p) =0 (mod p?) 
if and only if the linear congruence 
viy = —u (mod p) 


is solvable. We see that this congruence does have a solution y; because 
(v1, p) = 1. Let 


Ly = X1 + Yip. 
Then 
f(a2)=0 (mod p) and w#2=2, (mod p). 
Let k > 3, and assume that we have constructed integers r2,..., 2-1 
such that 
f(a;:)=0 (mod p’) and 2;=2;-1 (mod p’') 
for i= 2,...,k—1. There exists an integer uz_1 such that 


f(ze—1) = Up—rp**. 


Let f’(@p—-1) = Up—1. Since vp-1 = 21 (mod p), it follows that 
Ue-1 = f"(te-1) = f'(@1) #0 (mod p). 
Applying Theorem 3.18 with t = 2,_1 and h = y,_1p*~!, we obtain 
f (we—-1 + ye-1p**) 


f (ep—1) +f! (@e-1)Ye—1P” | + 1 (@R-1, Yee” )yR_ Pr 


= up-1p* | + ve-1ye—-1p"' (mod p*). 


2 


It follows that 
a (ax 1+ Yk ip**) =0 (mod p*) 


if and only if there exists an integer yz—1 such that 


Uk—-1Yk—-1 = —Up—1 (mod p). 


This last congruence is solvable, since (vz_1,p) = 1, and the integer 7, = 
Tp-1 + Ye-1p"—' satisfies conditions (3.14) and (3.15). 


Theorem 3.20 Let p be an odd prime, and let a be an integer not divisible 
by p. If a is a quadratic residue modulo p, then a is a quadratic residue 
modulo p* for every k > 1. 
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Proof. Consider the polynomial f(a) = x? —a and its derivative f’(x) = 
2x. If a is a quadratic residue modulo p, then there exists an integer 7, such 
that 2; #0 (mod p) and x} =a (mod p). Then f(z1) =0 (mod p) 
and f’(z,) #0 (mod p). By Hensel’s lemma, the polynomial congruence 
f(z) =0 (mod p*) is solvable for every k > 1, and so a is a quadratic 
residue modulo p* for every k > 1. 


Exercises 


1. Let 2, = 3. Costruct integers x, such that 7? = 2 (mod 7*) and 
L_=Atp-1 (mod 7*-+) for k= 2,3,4. 


2. Let p be a prime, p ¥ 3, and let a be an integer not divisible by p. 
Prove that if a is a cubic residue modulo p, then a is a cubic residue 
modulo p* for every k > 1. 


3. Denote the derivative of the polynomial f(x) by D(f)(x) = f’(x). 
We define 


DOH. = Ja), 
DY(P(e) = D(D* Nf) (@) for k>1, 


The polynomial D“)(f) is called the kth derivative of f. Prove that 
if f(x) is a polynomial with integer coefficients, then D”)(f)(x) = 0 
if and only if the degree of f(x) is at most k — 1. 


4. Let f(a) and g(x) be polynomials. Prove the Leibniz formula 
Df + 9)(x) = F(a) - D(g)(a) + DF)(@) - g(@). 
5. Let f(x) be a polynomial of degree n. Prove Taylor’s formula 
DNS) os 
f(cth= >> ao 


k=0 


6. This exercise generalizes Hensel’s lemma (Theorem 3.19). Let p be a 
prime, and f(x) a polynomial of degree n with integer coefficients and 
leading coefficient not divisible by p. Let @ be a nonnegative integer. 
If there exists an integer x; such that 


f(a) = 
oa! = 


(mod p”™*"), 


0 
0 (mod p‘), 
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and 
f'(a1) #0 (mod p*"), 


then for every k > 2 there exists an integer xz, such that 
f(x~) =0 (mod gtr) 


and 
L_ =Xp-1 (mod prey, 
Hint: Prove by induction on k. To begin the induction, find an integer 
y, such that f(a; + yp*t?) = 0 (mod p*4+?) and let v2 = 21, + 
+1 
Yip. 


3.7 Notes 


Primitive roots and quadratic reciprocity are classical topics in number 
theory and a standard part of an introductory course in the subject. 

There are still many simple questions about primitive roots that we can- 
not answer. For example, we cannot determine the prime numbers for which 
2 is a primitive root. We do not even know if the number of such primes is 
finite or infinite. Gauss conjectured that 10 is a primitive root for infinitely 
many primes. This would imply, by Exercise 9 in Section 2.5, there are 
infinitely many primes p such that the decimal expansion of the fraction 
1/p has period p—1. We do not, in fact, know even one integer that is a 
primitive root for infinitely many primes. There is an amazing result due to 
Gupta and Murty [44] and Heath-Brown [62] that states that every prime 
number, with at most two exceptions, is a primitive root for infinitely many 
primes. It follows that at least one of the numbers 2,3, and 5 is a primitive 
root for infinitely many primes, but we do not know which one. 

Let a be an integer that is not a square and a # —1. A conjecture of 
Artin [5, page viii] states that there exist infinitely many primes for which 
a is a primitive root. Moreover, Artin has a conjectured density for the set 
of primes for which a is a primitive root. Murty [98] is a nice survey paper 
of Artin’s conjecture and its generalizations. Erdés asked the following: For 
every sufficiently large prime p, does there exist a prime gq < p such that q 
is a primitive root modulo p? 
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Fourier Analysis on Finite Abelian 
Groups 


4.1 The Structure of Finite Abelian Groups 


This chapter introduces analysis on finite abelian groups and their char- 
acters. We begin by using elementary number theory to determine the 
structure of finite abelian groups. 

Let G be an abelian group, written additively, and let A,,...,A, be 
subsets of G. The sum of these sets is the set 


Ay +--+: +Ap = {ar +--+ 4%: a; € A; fori=1,...,k}. 


If G,,...,G, are subgroups of G, then the sumset G; +---+ Gy, is a 
subgroup of G (Exercise 2). We say that G is the direct sum of the subgroups 
Gi,...,Gx, written G = G1 6---@Gx, if every element g € G can be written 
uniquely in the form g = g1 +---+ 9x, where g; € G; fori =1,...,k. If 
G=G,@---®Gx, then |G] = |G,|---|G,| (Exercise 3). 

The order of an element g in an additive group is the smallest positive 
integer d such that dg = 0. By Theorem 2.16, the order of an element of a 
finite group divides the order of the group. 

Let p be a prime number. A p-group is a group each of whose elements 
has an order that is a power of p. For every prime number p, let G(p) 
denote the set of all elements of G whose order is a power of p. Then G(p) 
is a subgroup of the abelian group G (Exercise 6). 


Theorem 4.1 Let G be a finite abelian group, written additively, and let 
|G| = m. For every prime number p, let G(p) be the set of all elements 
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g € G whose order is a power of p. Then 


G= Qc). 
p|m 


Proof. Let m = nee p;' be the standard factorization of m, and let 
m, = mp," fori =1,...,k. Then (m1,...,mx) = 1 by Exercise 15 in 
Section 1.4, and so there exist integers u1,...,uxz such that 


myuy +--+ + mpup = 1. 


Let g € G, and define g; = miuig € G for i = 1,...,k. Since p;'g; = 
mujg = 0, it follows that g; € G(p). Moreover, 


g = (miu +--+ + meue)g = mug +--+ + mMpurg 
= git---+gn € G(p1) +--- + G(pe), 


and so 
G = G(pi) +--+ + G(pr). 


Suppose that 
Qt +o, =9, 


where g; € G(p;) fori = 1,...,k. There exist nonnegative integers r1,..., 1k 
such that g; has order p;* fori =1,...,k. Let 


k 
d; = || vi. 
mi 
If g; A 0, then djg; 4 0. Since djg; = 0 fori =1,...,k, i A Jj, it follows 
that 
0 = dj(g1 +--+ + gr) = 4595, 
and so g; = 0 for all j = 1,...,k. Thus, 0 has no nontrivial representation 


in G = G(pi)+---+G(pz). By Exercise 4, we conclude that G is the direct 
sum of the subgroups G(p;). 


Lemma 4.1 Let G be a finite abelian p-group. Let g, € G be an element 
of maximum order p™, and let Gy = (g,) be the cyclic subgroup generated 
by gi. Consider the quotient group G/G,. Leth € G. Ifh+G, € G/G, 
has order p’, then there exists an element g € G such that g+G, =h+G, 
and g has order p” in G. 


Proof. If h + G; has order p" in G/G}, then the order of h in G is 
at most p"™ (since p™ is the maximum order in G) and at least p” (by 


4.1 The Structure of Finite Abelian Groups 123 


Exercise 7). Since G, = p"(h+ G1) = p"h4+ Gi, it follows that p"h € Gi, 
and so p"h = ug; for some positive integer u < p™ (since gi has order p"). 
Write u = p*v, where (p,v) = 1 and 0 < s < r;. Then vg; also has order 
p™, and so p*vg, has order p™!~* in G. Then p’h = p*vg; has order p™~* 
in G, and so h has order p™*"~* < p™. It follows that r < s, and 


Ss 


p’h = p*vg; = p" (p*"vgi) = pg}, 


where 
7h =P "vn € Gy 
Let 
g=h—g. 
Then 


gt+G,=h+G\. 


Moreover, p’g = p"h — p’gi = 0, and so the order of g is at most p”. On 
the other hand, g + G; has order p” in the quotient group G/G,, and so 
the order of g is at least p". Therefore, g has order p”. 


Theorem 4.2 Every finite abelian p-group is a direct sum of cyclic groups. 


Proof. The proof is by induction on the cardinality of G. Let G be a 
finite abelian p-group. If G is cyclic, we are done. If G is not cyclic, let 
gi: € G be an element of maximum order p”, and let G; be the cyclic 
subgroup generated by g;. The quotient group G/G; is a finite abelian 


p-group, and 
G 
1 < |G/G,| = 1G] < |GI. 
De 


Therefore, the induction hypothesis holds for G/G, and so 
G/G, = H2®---@ Akg, 


where H; is a cyclic subgroup of G/G, of order p" for 1 = 2,...,k. More- 
over, 


Ia| : 
a IG/G,| = Il» i 
1=2 
By Lemma 4.1, for each 1 = 2,...,k there exists an element g; € G such 


that g; + G, generates H; and g; has order p”™ in G. Let G; be the cyclic 
subgroup of G generated by g;. Then |G;| = p" for i = 1,...,k. We shall 
prove that G=G,@---@®Gz. 

We begin by showing that G = G, + --- + Gx. If g € G, then g+ Gy € 
G/G,, and there exist integers u2,...,u,% such that 


O<uj<p*—-1 fori=2,...,k 
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and 
g + Gy = uU2(g2+ Gi) ®--- Bunge + Gi) = (age +--+ + Uege) + G1. 


It follows that 
g — (U2g2 +--+ + Urge) = gi € Gy 


for some integer wu, such that 
0 Sip a1, 
and so 
g = U191 + Uggg +--+ + Ung, € Gr +--+ + Ge. 


Therefore, G= G, + ---+ Gx. Since 


k 
|G] =|Gi +--+ + Gel < |Gil---|Gk| = [] o" = IG, 


i=l 


it follows that every element of G has a unique representation as an element 
in the sumset G, +---+G,;, andsoG=G,@+---+@G,. This completes 
the proof. 


Theorem 4.3 Every finite abelian group is a direct sum of cyclic groups. 


Proof. This follows immediately from Theorem 4.1 and Theorem 4.2. 


Let G1,..., Gz, be abelian groups, written additively. Their direct product 
is the group 


Gy x xX Ge = {(91,---5 9k) gi € Gi fori=1,...,k}, 
with addition defined by 


(Gis +++ 19k) + (Gis +++ 9k) = (91 + G1s+-++ 9k + Ik): 


If G,,...,G , are subgroups of an abelian group G and if G = G,|@---®Gz, 
then G&G, x --- x G, (Exercise 5). 

Let G1,...,G, be abelian groups, written multiplicatively. Their direct 
product is the group G, x --- x Gx consisting of all k-tuples (g1,..., 9x) 
with g; € G; fori =1,...,& and multiplication defined coordinate-wise by 


(915 +++: 9k) (G15-+ +99) = (9191>+- +5 GEG): 
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Exercises 


1. Let G = Z/12Z be the additive group of congruence classes modulo 
12. Compute G(2) and G(3) and show explicitly that G(2) = Z/4Z, 
G(3) & Z/3Z, and 


Z/12Z ~ Z/4Z © Z/3Z. 


2. Let G be an abelian group, written additively, and let G1,...,G, be 
subgroups of G. Prove that G, +---+G, is a subgroup of G. 


3. Let G be an abelian group, written additively, and let Gj,...,G 
be subgroups of G such that G = Gi, + --- + Gx. Prove that |G| 
|G,|---|Gx|. Prove that G = G, ®--- © G, if and only if |G| 
IGil---|Gxl. 


IL lA = 


4. Let G be an abelian group, written additively, and let G1,...,Gy 
be subgroups of G such that G = G, +--- + Gx. Prove that G = 
G, ®---@Gz, if and only if the only representation of 0 in the form 
O=g +--+ 9x with g € Gi is gg. =---= ge =0. 


5. Let Gi,...,G_ be subgroups of an abelian group G such that G = 
G, ®-:-® Gy. Prove that G&G, x +--+ x Gr. 


6. Let G be an additive abelian group. For every prime number p, let 
G(p) denote the set of all elements of G whose order is a power of p. 
Prove that G(p) is a subgroup of G. 


7. Let f :G— H bea group homomorphism, and let g € G. Prove that 
the order of f(g) in H divides the order of g in G. Prove that if G is 
a p-group and f is surjective, then H is a p-group. 


8. Let G be a finite abelian p-group. If r1,...,r~ are positive integers 
with r; > --- > rz, then we say that G is of type (p™,...,p™*) 
if G S G, @--- ® Gg, where G; is a cyclic group of order p”™ for 
i =1,...,k. We shall prove that every finite abelian p-group has a 
unique type. 


Let pG = {pg:g € G}. 


(a) Prove that pG is a subgroup of G. 
(b) Prove that if G is of type (p"!,...,p"*) with r; > 2 and rj41 = 
-+- rp =1, then pG is of type (p"™~1,...,p"7~"). 
(c) Prove that 
|G] = p*|pGl. 


(d) Prove that if G is of type (p™,...,p"*) and also of type (p*,..., p%*), 
then k = £. 
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(e) Prove that if the finite abelian p-group G is of type (p"™,...,p"*) 
and of type (p*!,...,p**), then r; = s; for i=1,...,k. 
Hint: Use induction on the cardinality of G. Let j and @ be 
the greatest integers such that r; > 2 and sg > 2, respectively. 
Apply the induction hypothesis to pG to show that 7 = @ and 
r; = 8; fori=1,...,7. 


4.2 Characters of Finite Abelian Groups 


Let G be a finite abelian group, written additively. A group character is 
a homomorphism x : G > C%*, where C% is the multiplicative group of 
nonzero complex numbers. Then x(0) = 1 and x(gi + g2) = x(91)x(g2) for 
all 91,92 € G. 

If y is a character of a multiplicative group G, then y(1) = 1 and 
x(9192) = x(91)x(92) for all gi, g2 € G. 

We define the character yo on G by yo(g) = 1 for all g € G. 

If G is an additive group of order n and if g € G has order d, then 


x(9)* = x(dg) = x(0) = 1, 


and so x(g) is a dth root of unity. By Theorem 2.16, d divides n and y(g) 
is an nth root of unity for every g € G. We have |x(g)| = 1 for all g € G. 
We define the product of two characters y; and x2 by 


x1xX2(9) = x1(g)x2(9) 


for all g € G. This product is associative and commutative. The character 
Xo is a multiplicative identity, since 


xox(9) = xo(g)x(g9) = x(g) 


for every character y and g € G. 
The inverse of the character y is the character y~! defined by 


since 
xx (9) = =x(9)x~"(9) = x(9)x(-9) 
= x(g-g9)=x(0)=1 
= xo(g), 


and so xx~! = xo. 


The complex conjugate of a character x is the character ¥ defined by 
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Since |x(g)| = 1 for all g € G, we have 


(xx)(9) = x(9)xX)(9) = Ix(g)? = 1 = xolg), 


and so 


for every character y and all g € G. 

It follows that the set of all characters of a finite abelian group G is an 
abelian group, called the dual group or character group of G, and denoted 
by G. We shall prove that G = G for every finite abelian group G. We 
begin with finite cyclic groups. 


Lemma 4.2 The dual of a cyclic group of order n is also a cyclic group 
of order n. 


Proof. We introduce the exponential functions 
e(z) = e2ma 


and 


€n(x) = e(a/n) = e?*7/™. 


The nth roots of unity are the complex numbers e,, (a) for a = 0,1,...,n—1. 
Let G be a finite cyclic group of order n with generator go. Then G = 
{jgo:j =90,1,...,n —1}. For every integer a, we define Ww, € G by 


Ya(Jgo) = en(aj). (4.1) 


By Exercise 3, we have Wath = Vatb, Vat = Va, Va = Wp if and only if 
a=b (mod n). It follows that 


Wa = ut 
for every integer a. If x is a character in G, then x is completely determined 
by its value on go. Since (go) is an nth root of unity, we have y(go) = en(a) 
for some integer a = 0,1,...,2—1, and so y(jgo) = en(aJ) for every integer 


j. Therefore, y¥ = Wa and 


G = {va:a=0,1,...,n—1} = {b%:a=0,1,...,n—1} 


is also a cyclic group of order n, that is, G = c 


It is a simple but critical observation that if g is a nonzero element of a 
cyclic group G, then (gg) 4 1 (Exercise 4). 
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Lemma 4.3 Let G be a finite abelian group and let Gi,...,Gx be sub- 
groups of G such that G= G1 @---@Gz. For every character x € G there 
exist unique characters y; € G; such that if g © G and g = 91 +-::+ 9k 
with g; © G; fori=1,...,k, then 


x(9) = x1(91) ++ XK (9K)- (4.2) 
Moreover, aso re 
GG, x-:--x Gr. 

Proof. If x; € G. fori =1,...,k, then we can construct a map xy: G > 


C™% as follows. Let g € G. There exist unique elements g; € G; such that 
g=ot+--:+ 9x. Define 


x(g) = x(91 +++ + 9%) = X1(91) +++ XK CGe): 
Then y is a character in 6. and this construction induces a map 
VC Merwe, Se (4.3) 


By Exercise 5, the map W is a one-to-one homomorphism. We shall show 
that the map W is onto. Let x € G. We define the function .; on G; by 


xi(gi) = x(gi) for all g; € Gj. 


Then y; is a character in G. Ifg€Gandg=git+-:--+ 9% with g € Gi, 
then 


x(9) = x(g1 +-++ + on) = x(g1) +++ x(9e) = x1 (91) +> xe (Ge): 


It follows that 
W(xa,--->Xk) =X; 


and so W is onto. 


Theorem 4.4 Let G be a finite abelian group. If g is a nonzero element 
of G, then there is a character x € G such that x(g) 4 1. 


Proof. We write G = G1 @---@G;, as a direct product of cyclic groups. 
If g £0, then there exist g; € Gi,..-, 9% € Gy such that g = g. +---4+ 9k, 
and g; # 0 for some j. Since the group G; is cyclic, there is a character 
xj E G; such that y;(g;) #1. Forti =1,...,k,i Aj, let x; € G; be the 
character defined by x;(gi) = 1 for all g; € G;. If xy = U(y1,.--, xk) € G; 
then x(g) = xj(9;) #1. 
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Theorem 4.5 A finite abelian group G is isomorphic to its dual, that is, 
G2e 
Proof. By Lemma 4.2, the dual of a finite cyclic group of order n is also 
a finite cyclic group of order n. By Theorem 4.3, a finite abelian group G 
has cyclic subgroups Gj,...,G, such that 
G=Gi0-::-@G. 


By Lemma 4.3 and Exercise 5 in Section 4.1, 


CLG ie GO, SE eee GLE GG OSG. a 


This completes the proof. 


Let G be a finite abelian group of order n. There is a pairing ( , ) from 
G x G into the group of nth roots of unity defined by 
(a,x) = x(a). 


This map is nondegenerate in the sense that (a, ) = 1 for all group elements 
a € Gif and only if y = yo, and (a,x) = 1 for all characters y € G if and 
only if a = 0 (by Theorem 4.4). 

For each a € G, the function (a, ) is a character of the dual group G, that 


is, (a, ) € G. The map A: G => G defined by at— (a, ) or, equivalently, 
A(a)(x) = (a,x) = x(a), (4.4) 


is a homomorphism of the group G into its double dual G. Since the pairing 
is nondegenerate, this homomorphism is one-to-one. Since |G] = IG| = IG, 


it follows that A is a natural isomorphism of G onto Gc 


Theorem 4.6 (Orthogonality relations) Let G be a finite abelian group 
of order n, and let G be its dual group. If x € G, then 


_J ” #x=Xo, 
Ex@={ if X # Xo- 
IfaeG, then 

if a =0, 
¥ xia) ={ 0 a 
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Proof. For x € G, let 
S(x) = >) x(a). 
aeG 
If x = xo, then S(xo) = |G| = n. If x xo, then x(b) 4 1 for some b € G, 


and 


x(b)S(x) = x(b) S> x(a) 


and so S(yv) = 0. 
For a € G, let 


T(a) = >> x(a). 
xeG 
If a = 0, then T(a) = |G| = n. If a £0, then y/(a) # 1 for some y’ € G 
(by Theorem 4.4), and 


V'(@)T(a) = x(a) d) x(a) 


= S>x'x(a) 


xXEG 


=> a) 


xeG 
= T(a), 


and so T(a) = 0. This completes the proof. 


Theorem 4.7 (Orthogonality relations) Let G be a finite abelian group 
of order n, and let G be its dual group. If x1, x2 € G, then 


sip, 2 A AE Ny 
S> x1(a)Xa(@) = { 0 ifs # a 


If a,b © G, then 
_ n ifa=b, 
Exoxm={ 5 Heo} 
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Proof. These identities follow immediately from Theorem 4.6, since 


x1(a)x2(@) = x1x3"' (a) 


and 
x(a)x(b) = x(a — 8). 


This completes the proof. 


The character table for a group has one column for each element of the 
group and one row for each character of the group. For example, if C4 is 
the cyclic group of order 4 with generator go, then the characters of C4 are 
the functions 

Va(J90) = eaaj) = i 


for a = 0,1,2,3, and the character table is the following. 


0 | go | 290 | 390 
Mela lt als 
wy | 1 a} -1 | -1 
gf = On 
w3)1} —7} -1 a 


Note the that sum of the numbers in the first row is equal to the order 
of the group, and the sum of the numbers in each of the other rows is 0. 
Similarly, the sum of the numbers in the first column is the order of the 
group, and the sum of the numbers in each of the other columns is 0. This 
is a special case of the orthogonality relations. 


Exercises 
1. Let C2 be the cyclic group of order 2. 


(a) Compute the character table for C}. 
(b) Compute the character table for the group C2 x C9. 


2. Compute the character table for the cyclic group of order 6. 


3. Let G be a finite cyclic group of order n. Define the characters qq on 
G by (4.1). Prove that 


(a) Vat, = Vato, 
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(b) Wee = Wa; 
(c) Wa = Vy if and only ifa=b (mod n). 


. Prove that if G is cyclic and g € G,g #0, then v1(g) 4 1. 
. Prove that the map W defined by 4.3 is a one-to-one homomorphism. 


. Consider the map( , ):G~x G — C% defined by 


(9,X) = x(g)- 


Prove that 


(gt+g9',x) =(9,x)(9',x) and (g,xx’) = (9.x) (9, x’) 


for all g'g! € Gand x,y €G. 


. Let G= Z/mZ x Z/mZ. For integers a and b, we define the function 


Wa,p on G by 
Wa,p(a + mZ, y + mZ) = e2™*ar+by)/m — | (ax + by). 


(a) Prove that Wa,» is well-defined. 


(b) Prove that Way = ea if and only ifa=c (mod m) and b=d 
(mod m). 


(c) Prove that qq.) is a character of the group G. 
(d) Prove that G= {Wap : a,b =0,1,...,m— 1}. 


. Let p be a prime number, and let G = (Z/pZ)* be the multiplicative 


group of units in the field Z/pZ. Let g be a primitive root modulo p. 
For every integer a, define the function y, : G — C” as follows: If 
(a,p)=landx=g" (mod p), then 


Xa(x + pZ) = e?™4¥/-)) = @,_s (ay). 


(a) Prove that x, is a character, that is, vq € CG. 
(b) Prove that ya = x» if and only ifa@=b (mod p—1). 
(c) Prove that G = {y,:a=0,1,...,p—2}. 


. Let G be a finite abelian group. For every integer r, let 


G" = {rg:g€G} 


and %, t, 
Gr={x €G:x" = xo}. 


(a) Prove that G” is a subgroup of G and G, isa subgroup of Gi 
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(b) Let d= (r,n). Prove that G" = G4 and G, = Ga. 

(c) Let y € G. Prove that y € G, if and only if y(a) = 1 for all 
aeG’. 

(d) Let y € G,. Define the function x, on the quotient group G/G” 
by 

Xr(a+ G") = x(a). 

Prove that x, is well-defined. Prove that vy, € G/G, and that 
the map from GC. to G/G,. defined by y > y, is a group isomor- 
phism. 


10. Let G be a finite abelian group and G” = {rg: g € G}. Let [G: G"] 
be the index of the subgroup G” in G. Prove that 


s (a) = IG: G7] ifaeG" 
of AN 10 raca, 
xXEGr 


Hint: Consider the quotient group G/G", and note that IG, 
GIG, =(G:G"]. 


4.3 Elementary Fourier Analysis 


Let G be a finite abelian group of order n, and let L?(G) denote the n- 
dimensional vector space of complex-valued functions f on G. The complex 
conjugate of f € L?(G) is the function f € L?(G) defined by 


for alla eG. 
For a € G, we define the function 6, € L?(G) by 


1 ife=a, 
én(e) = { 0 ifaFa. 
If f € L7(G), then 
f= S- f(a)ba, 


acG 


and the set of n functions {6q : a € G} is a basis for the vector space L?(G). 
We define a function y on the subsets of G by 


w(U) = |U| 


for all U C G. Then pu(G) = n, and yu is additive in the sense that, if Uy and 
U2 are disjoint subsets of G, then (U1, UU2) = (U1) + u(U2). The function 
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js is also translation invariant, since u(a + U) = p(U) for all U C G and 
a € G. We call p a Haar measure on the group G.! 
Using the measure j1, we define the integral of f € L?(G) as 


[- [omen Es 


We define an inner product on the space L?(G) by 


(ths fa) = [ah= S> Ala)fl@) 


xrEG 
Then : 
1 ifa=b, 
(6a, 6b) = D3 hale y={ 4 ifaFZ b, 


and so the set of functions {6q : a € G} is an orthonormal basis for L?(G). 
Moreover, for all f € L?(G) and a € G, we have 


ba) =) | # € Gf (2)ba(x) = f(a). 


The L?-norm of a function f € L?(G) defined by 


1/2 
Ifle=(/)'? = (Si r) : 


rEG 


The Cauchy-Schwarz inequality states that 


(fi, fa)| < Wfillall falle (4.5) 


for all functions f,, fo € L?(G) (Exercise 5). 

A character is a complex-valued function on G, and so G C L?(G). We 
shall show that G is also a basis for L?(G). 

If x1, 2 are characters of G, then the orthogonality relations (Theo- 
rem 4.7) imply that 


(x1,X2) = [ox 
G 


S> x1(a)xa(a) 
acG 
= n if x1 =xe 
0 ifviA# x2, 


lWe can also define a measure y on G by p(U) = |U|/n. This has the advantage that 
u(G) = 1, but it is not the traditional choice in elementary number theory. 
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and so the n characters in the dual group G are orthogonal in the vector 
space L?(G). Since |G| = |G] = dimc L?(G) = n, it follows that G is a 
basis for L?(G). 

There are an analogous Haar measure and inner product on the dual 
group G. If f, f2 € L?(G), then 


(A, h)= [,fh= SS AAD. 


xEG 


Let G denote the double dual of G, that is, the group of characters of the 
dual group G. For a € G, we defined A(a) € G by 


and we proved that every character in Gis of the form A(a) for some a € G. 
By the orthogonality relations (Theorem 4.7), for every a,b € G we have 


(Ala), A(b))g = SY) ADAG (Od) 


I 

JM 
rad 
S 
el 
Z 


The Fourier transform is a linear transformation from L?(G) to L?(G) 
that sends the function f € L?(G) to the function f € L?(G), where 


Fx) = (fx) = S5 F)X(9)- (4.6) 
gEG 
For example, the Fourier transform of the function 6, € L?(G) is 


bax) = >> ba(g)X(g) = X(@) = x(—a). 


gEG 


The process of recovering f from its Fourier transform f is called Fourier 
inversion. 


Theorem 4.8 (Fourier inversion) Let G be a finite abelian group of or- 
der n with dual group G. If f € L?(G), then 


f= - S- FOOx (4.7) 


xXEG 
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and (4.7) is the unique representation of f as a linear combination of char- 
acters of G. 


Let A: G > G be the isomorphism defined by A(a)(x) = x(a) for all 


VEG. if Pe L?(G), then fe i (@), and, for every a € G, 


n~ 


f(A(a)) = nf(—a). (4.8) 


Proof. This is a straightforward calculation. Let a € G. Defining the 
Fourier transform by (4.6), we have 


> fooxa) = => (= rox) x(a) 


XEG xXEG 


by the orthogonality relations (Theorem 4.7). This proves (4.7). The unique- 
ness of the series (4.7) is Exercise 2. 
To prove (4.8), we have 


FAM) = SD FOIBOO 


xXxEG 


= SY sxx) 


YEG IEG 


= > #9) do xXg+2) 


geG xeG 


= nf(-a). 


This completes the proof. 


The sum (4.7) is called the Fourier series for the function f. 


Theorem 4.9 (Plancherel’s formula) If G is a finite abelian group of 
order n and f € L?(G), then 


Ilfll2 = Vallflle- 
Proof. We have 


n~ nN 


(Fle Ge) 
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= ni fil. 
This completes the proof. 


Let G be a finite abelian group of order |G| = n, and let f € L?(G). The 
support of f is the set 


supp(f) = {a € G: f(a) # 0}. 
We define the L°°-norm of a function f € L?(G) by 
II flloo = max{|f(a)| : a € G}. 


For every function f € L?(G) we have the elementary inequality 
WF = (A) = > F@P < [fz [supp (/)1. (4.9) 
aeG 


The uncertainty principle in Fourier analysis states that if f € L?(G) is 
a function with Fourier transform f € L?(G), then the sets supp(f) and 


n 


supp(f) cannot be simultaneously small. This has the following quantitative 
formulation. 


Theorem 4.10 (Uncertainty principle) If G is a finite abelian group 
and f € L?(G), f £0, then 


|supp(f)||supp(f)| = |G. 
Proof. Let a € G. By Theorem 4.8, 


1 a 
fla) =— > Fodxta). 
xeG 
Since |x(a)| = 1 for all x € ‘es it follows that 


V@l<= Sd ifol== Yo IFoo 


xeG xEsupp( 


<>) 
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and so 


Ilo <= > [FOOL 
xesupp(f) 
Applying the Cauchy-Schwarz inequality (4.5) with f; = FX) and with fo 
the characteristic function of the set supp f , we have 


2 


S> fool] = SS IFO? lsupp(F)I. 


xesupp(f) xesupp(f) 


Using Plancherel’s formula (Theorem 4.9), and inequality (4.9), we obtain 


2 


Wee < s| S&S Fo 


< a) 
xesupp(f) 

1 pS x 

< = YS [fePlupe(A) 
xesupp(f) 

io Ba 
= S[fislsupp A) 

1 ~ 
= =| f1Blsurr()) 
< =[f\P.\supp(f)|lsupp( A) 


Since f 4 0, we have ||f||.. > 0 and so 


n~ 


|supp(f)||supp(f)| 2 n = |GI. 
This completes the proof. 


If f € L?(G) and |supp(f)| = 1, then the uncertainty principle implies 
that |supp(f)| = |G|, that is, f(x) 4 0 for all x € G. Here is an example. 
Let a € Gand f = 6, € L?(G). Then 6,(x) 4 0 if and only if x = a, and so 
|supp(5.)| = 1. We have 6,(v) = x(a) £ 0 for all y € G. This shows that 
the lower bound in the uncertainty principle is best possible. 


Exercises 
In these exercises, G is a finite abelian group. 


1. Let f,g € L?(G). Prove that 
(9, f) = (f.9)- 


10. 
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. Let f € L?(G). Prove that if c € L2(G) and f = (1/n) yea lx) 


then e(x) = f(x): 


. Prove that the Haar measure on G is unique, that is, there exists 


a unique function y on the subsets of G such that p is additive, 
translation invariant, and u(G) = n. 


. Let U : L?(G) > L?(G) bea linear transformation such that U(5,)(x) = 


X(a) for all x € G. Prove that U is the Fourier transform, that is, 
U(f) =f for all f € L(G). 


. (Cauchy-Schwarz inequality) Let f,g € L?G. Prove that 


If, 9)I < IIfllellglle- 


Hint: If \ € C, then || f — Ag||2 > 0. For g 4 0, apply this inequality 
with A = (f,9)/(9.9)- 


. Prove that if f,g € L(G), then 


If + gll2 < Iflle+ Ilglle- 


. Let x1, x%2 € G. Prove that 


oe fn ifx1=xe2 
X1(x2) = { 0 ifxi14# xX. 


. Use the uncertainty principle to prove that the Fourier transform is 


one-to-one. 
Hint: Prove that if f € L(G) and f £0, then f £0. 


. For a € G and f € L?(G), we define the translation operator T;, on 


L?(G) by Ta(f)(x) = f(x — a). Prove that T,(f) = (a) f. 


For functions fi, f2 € L?(G), we define the convolution fi*f2 € L?(G) 
by 


fis hte) = | hla-2)h(e)de= DO flea) hl. 
ae€G 


(a) Prove that 
fixhla)= > fle 


x+y=a 


(b) Prove that convolution is commutative, that is, 


fi * fo = fo* fi. 
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(c) Prove that convolution is associative, that is, 
(fi * fa) * fa = fi * (fo * fa). 
(d) Prove that, if fi,..., fe € L7(G), then 


freee fela)= So filei)-+ fete): 


G1++-+0_p=4 
11. Let x € G. Prove that 
xerexa)= DP x(a tata t+ + ae). 


12. Let p be a prime number, and define £,, € L?(Z/pZ) by 
a 


fp(a+ pZ) = (=) 


where (=) is the Legendre symbol. Prove that 


—_" Pp 


x x oe “2x 
ieee Bilan: SP (a2). 
k times eae 


13. Let fi, fo,..-, f¢ € L?(G). Prove that a product of Fourier transforms 
is the convolution of the product in the sense that 


A-h=fith 


and 


fi- fore fe = fi 8 fo®® fir 


14. Prove that 6. * f =Tu(f) for all f € L?(G). Use this to give another 
proof of Exercise 9. 


4.4 Poisson Summation 


Let G be a finite abelian group with subgroup H, and let L?(G)" be the 
vector space of complex-valued functions on G that are constant on cosets 
in G/H, that is, 


L?(G)" = {f € L7°(G): f(c@ +h) = f(a) for allz € G andhe H}. 
Let G# be the group of characters of G that are trivial on H, that is, 
GH = {y €G: y(h) =] for all h € H}. 
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Lemma 4.4 Let G be a finite abelian group with subgroup H. Then 
GF =Gn L(G)". 
Proof. If y € G¥ C G, then x(x +h) = y(x)x(h) = x(x) for alle eG 


and h € H, and so x € GN L?(G/H). Conversely, if y € GN L?(G/H), 
then y(h) = x(0+ h) = x(0) = 1 for all h € H, and y € G". 


Lemma 4.5 Let G be a finite abelian group with subgroup H, and let mr : 
G — G/H be the natural map onto the quotient group. For f* € L?(G/H), 
define the map n*(f*) € L?(G) by 


m*(f*)(a) = flr(a) = fi(@+ H) 
for alla € G. Then r* is a vector space isomorphism from L?(G/H) onto 
L?(G)". Moreover, 
at (G/H) CG", 
and the map we 
nr: G/H — GF 
is a group isomorphism. 
Proof. Let f* € L?(G/H). If x € G and h € H, then 
m(f*)(@ +h) = flr(a +h) = fia(a) = a*(f*)(2), 


and so 7 maps L?(G/H) into L?(G)". It is easy to check that 7# is linear. 
Moreover, 7 is onto, since if f € L?(G)", then there is a well-defined map 
f* © L?(G/H) given by f#(2 +H) = f(x), and r*(f*)(x) = fi(a + H) = 
f(a) for all a € G. Finally, 7* is one-to-one since 1#(f*)(x) = 0 for all 
a € G if and only if f#(2 +H) = 0 for alla +H € G/H, that is, if and 
only if f# = 0. This proves that 7* is an isomorphism. 

If x# € G/H, then 


m*(x#) (a + y) 


I 


c+y+H) 
c+ H)x*(y+ H) 
x*)(x)m*(x4)(y), 


I 


and so m ie 
mix!) e Gn L(G)? = G*. 


It is left as an exercise to prove that 7? : G/H > GH isa group isomor- 
phism (Exercise 2). 
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Theorem 4.11 (Poisson summation formula) Let G be a finite abelian 
group and H a subgroup of G. If f € L?(G), then 


1 
[a] 2, f= Gl ae 


xeGH 


Proof. Let f € L(G) and y € G". We define the function f! € 
12(G/H) by 
a+ H)= ber f(a+y). 


yeH 


We define the character y# € G/H by x#(2+ A) = x(x). If r#: G/H — GH 
is the isomorphism constructed in Lemma 4.5, then m#(x#) = x, and the 
Fourier transform of f? is 


flo) = SS) fl@+A)xie+ F) 


e+HEG/H 


= YS YL Aet+ x2) 


t+HEG/H yCH 


= SS Vo fet+yxer+y) 


w+HEG/H yCH 


= SO f@x@) 


LEG 


= fi. 


It follows that the Fourier series for f# is 


fi@+H) = ar > ROM e+ #) 
x!eG/H 
= a S> FOX). 
xeGH 
Equivalently, for x € G, 
mL fet= aq XD Foor) 


yCH 


This is the Poisson summation formula. 
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Exercises 


In these exercises, G is a finite abelian group and H is a subgroup of G. 


1. 


Let G¥ denote the set of all characters y of G such that (h) = 1 for 
all h € H. Prove that G” is a subgroup of G. 


. Let wr! : G/H —+ G# be the map constructed in Lemma 4.5. Prove 


that x? is a group homomorphism. Define \: G! G/H by A(x) (a+ 
H) = x(a). Prove that A is a well-defined group homomorphism, and 
that AT} = nf, 


. Prove that G contains a subgroup isomorphic to G/H. 


Hint: 


G/H=G/H=G# cGe2a. 


. To each character x € G there is a corresponding character x’ € H 


defined by restriction: 
V(h)=x(h) for he H. 
Prove that this defines a homomorphism p : = H with kernel Gi. 
This induces a one-to-one homomorphism of p : é / GH —. H. Prove 
that p is surjective, and so 
CG 


Hint: These two groups have the same cardinality. 


. Let f € L(G), and define f* € L?(G) by 


= So f(w@th). 


heH 


Prove that f# € L?(G)" and 


ft tai det 


. Let G, and G2 be finite abelian groups. Let f € L?(G1 x G2). For 


x, € Gi, define the function f,, € L?(G2) by fx,(v2) = f(r, 22). 
Show that Poisson summation applied to the group G = G x G2 and 
subgroup H = G, x {0} gives 


S- fic, (0) =a ys, De. fal x2): 


©1€Gy 1 EG, x2€G2 
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7. Let f € L?(G x G). Use Poisson summation to prove that 


See) =a De SS) f(x, y)x(a)X(y). 


LEG xeG (x, yJEGxG 


Note that this identity is also an immediate consequence of the or- 
thogonality relations. 


8. This is another example that shows that the lower bound in the uncer- 
tainty principle (Theorem 4.10) is best possible. Let H be a subgroup 
of G, and define 64 € L?(G) by 


_fl ifeeH 
ule)={ if x ¢ H. 


(a) Prove that 
supp(6) = H. 


(b) Prove that if y € G, then 


pe H\ if ce GH 
fi = 4 | ee 


0 ifx¢G, 


(c) Prove that 
supp(6#)supp (61) = |G|. 


4.5 ‘Trace Formulae on Finite Abelian Groups 


We recall some facts from linear algebra. Let A = (a;;) be an n x n matrix. 
The trace of A is the sum of the diagonal elements of A, that is, 


n 
= s aii. 
i=1 


Let B = (b;;) be another n x n matrix. The simplest trace formula (Exer- 
cise 1) states that 
tr(AB) = tr(BA). (4.10) 


Every result in this section follows from this fundamental identity. 
Let V be an n-dimensional vector space, and let B = {v1,...,Un} bea 
basis for V. If T: V > V is a linear operator, and 


nm 
) = S AigVi, 
i=l 
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then the n x n matrix A = (a;;) = [T]g is called the matrix of the operator 
T with respect to the basis B. 
Let B’ = {vj,...,v/,} be another basis for V, and let 


Tu)\= de ait (4.11) 


Then A’ = (a;;) = [T]»’ is the matrix of T with respect to the basis B. 
Each vector vj, € B’ is a linear combination of the vectors in the basis B, 


n 


v; = So igi, (4.12) 


i=1 
and each vector v; € B is a linear combination of the vectors in the basis 
B’, 


n 


i=1 
Consider the n x n matrices R = (rj;) and S = (s;;). Then S = Ro 
(Exercise 2). We have 


— S rad auc SikVs 


l= i=l 


w= k=1 €=1 


Comparing this with (4.11), we obtain 


n n 
= s SikQkel ej 


k=1 @=1 


for all i,7 =1,...,n, and so 
A’ = SAR = RAR. 
Identity (4.10) implies that 
tr(A’) = tr(R7' AR) = tr(ARR™) = tr(A). 
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It follows that we can define the trace of a linear operator T on a vector 
space V as the trace of the matrix of T with respect to some basis for V, 
and that this definition does not depend on the choice of basis. 

The vector v’ € V is called an eigenvector for the operator T with eigen- 
value if vu’ £0 and T(v’) = Av’. The operator T is diagonalizable if there 
exists a basis for V consisting of eigenvectors, that is, there exist nonzero 
vectors v},...,u,, € V and numbers \j,..., An such that B’ = {v},...,v},} 
is a basis for V and T(v,) = Axv} for i = 1,...,n. In this case, the matrix 
for T with respect to the basis B’ is the diagonal matrix 


A 60) 060s 0 OO 
O A O --- 0 O 


p=| 0 0 » +: 0 0 


and so 


ai tr(A) = tr(D) i 


We restate this important identity as a theorem. 


Theorem 4.12 (Elementary trace formula) Let T be a linear opera- 
tor on an n-dimensional vector space V, let B be a basis for V, and let 
A = (aj;) be the matrix of T with respect to B. If T is diagonalizable, 
then V has a basis B = {v},...,v/,} of eigenvectors with T(v;) = Ay, for 
i=1,...,n, and the trace of A is equal to the sum of the eigenvalues of T, 


that is, 
n n 
i=1 t=1 


We shall show that both the Fourier inversion theorem and the Poisson 
summation formula are consequences of this elementary trace formula. 

Let G be a finite abelian group of order n, and let L?(G) be the n- 
dimensional vector space of complex-valued functions on G. For every a € G 
there is a linear operator T, on L?(G) defined by T,(f)(x) = f(~—a). The 
operator Ti, is called translation by a. 

Another class of operators on L?(G) are integral operators. A function 
K € L?(G x G) induces a linear operator ®x on the vector space L?(G) 
as follows: For f € L?(G), let 


Sx(f)(x) = : K(«,y)f(u)dy = So K (0,9) fv). 


yeG 


The map ®x is called an integral operator on L?(G) with kernel K(a, y). 
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Let G = {a1,...,2%,}. Associated to the kernel K is a matrix A = (a,j) € 
M,,(C) defined by 
aij = K (aj, 25). (4.14) 
Conversely, to every matrix A = (a;;) € M,(C) there is a function K(x, y) € 
L?(G x G) defined by (4.14), and an associated integral operator ®x. 


Theorem 4.13 Let G = {21,...,%n} be an abelian group of order n. Let 
K € L?(G x G) and let ®x be the associated integral operator on L?(G). 
The matriz of ®x with respect to the orthonormal basis {6,, :4=1,...,n} 
is (K(a;,2;)), and the trace of Px is 


n 


tr(®x) = >> K(ai, 24). (4.15) 


i=1 


Proof. The matrix of the operator ®x is (cj), where ¢;; is defined by 


) = S- Cig Ox; + 
a=1 


Then 


i) (x) = So K(xi,y)6 Ly (y) = K(ai, 23). 
yEG 


Cy = Ox(6 wy 


This completes the proof. 


Theorem 4.14 Let G be a finite abelian group. Let K € L?(G x G) with 
® x the associated integral operator on L?(G). The operator ®j commutes 
with all translations T,, that is, 


Ta® K(f) = ®KTa(f) 


for alla € Gand f € L?(G), if and only if there exists a function h € L?(G) 
such that K(x, y) = h(a—y) for allz,y € G. In this case, Bx is convolution 
by h, that ts, 


Bx (Ala) = he fla) = flew) Fa 


and the trace of ®x is 
tr(®x) = nh(0). 


Proof. Let f,h € L?(G). We define the convolution operator C;, on 
L?(G) by 


Cle) = he f(a) = | hee w)Fddy =O he FW) 


yeEG 
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(See Exercise 10 in Section 4.3.) Define K(x, y) € L?(G x G) by K(2z,y) = 
h(a — y). Then 


bx(Nle) = | Kewstw)dy= f n(e-w)foidv = Cra), 
G G 
and ®x is convolution by h. For a,x € G, we have 


TaCr(f)(z) = Cr(f)(x—-a) 
= So A(@-a-y)fly) 


yeEG 


= ) A(e-y)fy—a) 


yeEG 


= So A@—-y)Ta(f)(y) 


yeEG 


= CnTa(f)(), 


and so T,C, = CyTg, that is, convolution commutes with translations. 
Conversely, let K(x,y) € L?(G x G). For a,x € G and f € L?(G), we 
have 


Ta®x(f)(2) = Ox(f)(w — a) = 9) K(w—a,y) f(y) 


yEG 
and 


®xTa(f)(z) = DS) K(#,y)Ta(f)(y) 


yEG 


= SY) K(x,y)fy-a) 


yEG 


= Sl K(c,a+y)f). 


yEG 


If ®x commutes with translations, then T,®« = ®x«T,, and 


S> K(a-a,y) f(y) = 45 K(v,a+ y) f(y). 


yeG yeG 
Applying this identity to the function 


f(a) =8(0)={ 9 ite zo. 


we obtain K(x — a,0) = K(a,a) for all a,x € G. Define the function 
h € L?(G) by 
h(a) = K(a,0). 
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Then 
K (x,y) = K(x — y,0) = h(x — y) 


for all x,y € G, and the operator ®x is convolution by h(a). Moreover, 
tr(®x) = nh(0) by (4.15). This completes the proof. 


Theorem 4.15 (Trace formula) For h € L?(G), let Ch be the convolu- 
tion operator on L?(G), that is, C,(f) =h* f for f € L?(G). The dual 
group G isa basis of eigenvectors for C),. If x is a character in Cc then x 
has eigenvalue h(x), that is, 


Ch(x) = h(x)x; 


and 


3 
= 
< 

l 

M 
=) 
& 


xXEG 


Proof. This is a straightforward calculation. For x € G, we have 


Ch(x)(t) = h*x(x) =x * h(z) 
= So x(x-y)h(y) 
yEG 


= |S > ARx(y) | x(2) 


yeG 
= h(x)x(2), 


and so y is an eigenvector of the convolution Cy, with eigenvalue h(x). By 
Theorem 4.12, since G is a basis for L?(G), the trace of C;, is the sum of 
the eigenvalues, that is, 


tr(Ch) = 7 AGO. 
xeG 
By Theorem 4.14, we also have 
tr(C,) = nh(0). 


This completes the proof. 


We can immediately deduce the Fourier inversion formula (Theorem 4.8) 
from Theorem 4.15. If f € L?(G), then 


#0) = ~ > Foo. (4.16) 


xXEG 
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This trace formula can also be obtained by computing the Fourier series 
for f at x = 0. On the other hand, if we simply apply (4.16) to the function 
T_a(f) and use Exercise 9 in Section 4.3, then we obtain 


fa) = T-alf)(0) 
= = Srewoo 


xEG 


= *¥ foo) 


xXEG 


This is the Fourier inversion formula. 

Next, we derive the Poisson summation formula (Theorem 4.11) from 
the elementary trace formula. 

Let H be asubgroup of G, and let  : G > G/H be the natural map. For 
x € G, define 2 = r(x) = a+ H € G/H. There is an orthonormal basis 
for the vector space L?(G/H) that consists of the functions 6,:, where 


1 if ct = yf 
ty _ y 
baelvt) = { 0 ifat F yf. 
For f € L?(G), define the function f* € L?(G/H) by 
Wa +H) = S- f(a@t+y). 
yeH 


Let oe be convolution by f# on L?(G/H). The operator Cy: has matrix 
(f#(2* — y*)), with respect to the basis {6,:}. By Theorem 4.14, the trace 
of Cpe is 


tr(C y+) = |G/H|f*(04) = - S> fy 
Hl ee 


By Theorem 4.15, the character group G ) G/E Hf is a basis of eigenvectors for 
the convolution operator O,:. If x4 € G/H and y = m#(x#) € G4, then 


Crt) = ftot)yxt 


with eigenvalue 


fio) 


I 
= 
ae, 

8 
= 
oa 
Faia 

8 
Se 


= SS Yet yxX2) 


cieG/H yeH 


= So SYS flet+yx(et+y) 


zteG/H yeH 


= SO f@x@) 


“LEG 
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It follows that 


and so 


This is the Poisson summation formula. 


Exercises 


In these exercises, G is a finite abelian group of order n. 


1. Let A = (a;;) and B = (b;;) be n x n matrices. Prove that tr(AB) = 
tr(BA). 


2. Define the matrices R and S by (4.12) and (4.13). Prove that S = 
RSS 


3. Let G = {x1,...,2,}. To every matrix A = (aij) € M,(C) we as- 
sociate a function K4 € L?(G x G) by K4(aj;,7;) = a;;. Prove that 
the map A + Ky is a vector space isomorphism of M,,(C) onto 
L?(G x G). 


4. For a € G and h € L?(G), we have operators T,, and C;, on L?(G), 
where JT, is translation by a and C%, is convolution by h. Prove that 


Ch(6a) = Ta(h). 


4.6 Gauss Sums and Quadratic Reciprocity 


Let m be a positive integer, and Z/mZ the ring of congruence classes 
modulo m. An additive character modulo m is a character of the additive 
group Z/mZ. Since this group is cyclic, the additive characters are the 
functions w_ defined by 


Walk +mZ) = Goria Te ee €m(ak) 


fora =0,1,...,m—1, and the map from Z/mZ to Z/mZ that sends the 
congruence class a+ mZ to the character , is an isomorphism of additive 
groups. 

A multiplicative character modulo m is a character of the multiplicative 
group of units (Z/mZ)*. The principal character yo is defined by yo(a + 
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mZ) = 1 if (a,m) = 1. If x is a multiplicative character of Z/mZ, then we 
extend x to a function on Z/mZ by defining y(a + mZ) = 0 if (a,m) £1. 
Then y € L?(Z/mZ). The Fourier transform of x is XY € L? (Z/mz), 


where 


La) = DY x(k + mZ)alk + mZ) 
k+mZeEZ/mZ 
m—1 
= S- x(k + MZ)em(—ak). 
Gamat 
For every integer a and multiplicative character , we define the Gauss sum 


T(x, @) as the Fourier transform of x evaluated at the additive character 
w_a, that is, 


ra) = RMbal= YD xk+mZem(ak) (4.17) 
Geet 
— Sh Bean): (4.18) 
k=0 


In this section we study multiplicative characters and Gauss sums only for 
odd prime moduli p. 


Theorem 4.16 Let x be a nonprincipal multiplicative character modulo 
the odd prime p. Then 


T(x, @) = X(a+ pZ)r(x, 1). 
Proof. If p divides a, then e,(ak) = 1 for all k, and 


t(x,a) = > x(k + pZ)ep(ak) = > x(k + pZ) =0 
k=1 k=1 


by the orthogonality relations (Theorem 4.6). 
If p does not divide a, then |y(a+pZ)| = 1, the set {ak: k =1,...,p—1} 
is a reduced set of residues modulo p, and 


t(x,a) = >_x(k+pZ)ep(ak) 
k=1 
= 5 x(a + pZ)x(a-+ pZ)x(k + pZ)ep(ak) 
k=1 


p-1 


= X(a+pZ) S— x(ak + pZ)ep(ak) 
k=1 
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= YatpZ) ¥ x(k + pB)ep(k) 
k=1 
= X(a+ pZ)r(x,1). 


This completes the proof. 


Let p be an odd prime number, and let (:) be the Legendre symbol 


modulo p. We define the function ¢, € L*(Z/pZ) by 


m 1 if ais a quadratic residue modulo p, 
£,(a+ pZ) = (2) =<« —1 if ais a quadratic nonresidue modulo p, 
P 0 if p divides a. 


Then @, is a real-valued multiplicative character of Z/pZ, and 


By Theorem 4.16, 
a 
r(bp,a) = (2) r(p). (4.19) 


For example, 


7(3) = T(€3,1) = (5) e3(1) + @ €3(2) 


and 


2 
T(€3,2) = @ 7(3) = -iV3. 
Theorem 4.17 If p is an odd prime and (a,p) = 1, then 
p-l 
7&0) = a €p(ax”). 
x2=0 


In particular, 


p-1 
Oe 
z=0 
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Proof. The set R = {k € {1,...,p— 1}: &,(k + pZ) = 1} is a set of 
representatives of the congruence classes of quadratic residues modulo p, 
and N = {ke {1,...,p—1} : 6,(k+pZ) = —1} is a set of representatives of 
the congruence classes of quadratic nonresidues modulo p. We have |R| = 
|N| = (p—1)/2. Ifa? =k (mod p), then also (p—2x)? =k (mod p). Let 
x#0 (mod p). Since pis odd,z24#p—2 (mod p), and 


It follows that 


4 
Fe. 5 
as 
3 
g 
~S 
| 
3 
git 
ies 
Zao 
= 
SF 
O 
3 
alts 
g 
Ens 
Seas 


kai \P 
= So e(ak) — So ep(ak) 
keR keN 
ae ss €p(ak) — S- €p(ak) 
keR ke RUN 
p-l 
= 142 S- €p(ak) — €,(ak) 
keR k=0 
p-l 
= 1+9 e,(azx’) 
a=1 
p-1 
= S- e,(ax”) 
x2=0 


This completes the proof. 


Theorem 4.18 If p is prime and (a,p) = 1, then 


r(lpva)? = (—*) p= (1p. 


P 


Proof. If p does not divide a, then 


Hee =e (=) eae Ss (2) eptau) 
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i 
i 
oS 
8 
< 
Ny 
a) 
aS) 
ar 
Q 
ed 
8 
+ 
Ss 
II 
3 
| 
8 
Ye 
8 
= 
ee 
is 
S 
on 
frm) 
nt 
8 
+ 
8 
= 


Since 


pol i 
alae +y))= { =a ify#p—1 (mod p), 


it follows that 


repay? = TS (7) esate +) 


by Theorem 3.14. 


Theorem 4.19 Let p and q be distinct odd prime numbers. If (a,p) = 1, 
then 


r(lp,a)@? = (-1) °F (2) (mod 4). 
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Proof. By Theorem 4.18 and Theorem 3.12, 


This completes the proof. 


Recall that if G is a finite abelian group, then the map A: G > G 
defined by 


is an isomorphism. 
Theorem 4.20 If p and q are distinct odd primes, then 
nq _ Pew go! 
(6")(aC-a+ v2)) = prio (2), 
Proof. The function on the left side of the equation is a bit complicated. 


Let G = Z/pZ. Since £, € L*(G), it follows that the Fourier transform 
tp € L(G), and also its qth power é," e IL? (é). The Fourier transform 


o~ 


of this function is ae e L? (@). and so its domain is G = {A(a+ pZ) : 
a+ pZ € G}. We have 


oe 


(6,")(A(-a + p2)) 


a (We) A(—q + pZ) (be) 


- eee 
z=0 


= oe T (Lp, —£) "ba (—q + pZ) 
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I 
——N 
3 | 
NNW 
4 
eae 
ws 
OQ 
8 Ss 
iM 
a 
S18 
Se 
ao 
no} 
Fire 
8 
Sot 


by Theorem 4.18. This completes the proof. 


Theorem 4.21 [fp and q are distinct odd primes, then 


(6°) (a(-a+ 2) =p S- (a), 


ay+...¢%qg=q (mod p) D 
1l<aj<p-1 
Proof. Let k be a positive integer. By Exercise 10 in Section 4.3, a 
product of Fourier transforms is the Fourier transform of the convolution, 
and so 


tp = ly * «l= lp x * by 
a ee 
k times k times 


By (4.8) of Theorem 4.8, for every integer a we have 


Fe eee 
SS 


(G)(A(-a+ pZ)) = bye # 6p(A(-a + p2)) 
— 
k times 


= ply*---*b,(a+ pZ). 
—_-___— 
k; times 
By Exercise 12 in Section 4.3, 
——" eine p 
k; times 1<2jSp-1 


If k =a=dq, then 
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This completes the proof. 


We can now give a second proof of the quadratic reciprocity law. Let p 
and q be distinct odd primes. By Theorem 4.20 and Theorem 4.21, 


rom ()=»  (AS%) 


ayt+...¢%g=q (mod p) 
l<ai<p-1 


By Exercise 14 in Section 3.4, 


and so 


and so 
It follows that 


This is the quadratic reciprocity law. 


Exercises 
1. Show that 
(5) =2 (cos = + cos ~ 
ole => ae COs 5 
2. Show that 


2 4 
1(7) = 12 (sin = + sin = sin). 


3. Let p be an odd prime and yo the principal character modulo p. Prove 
that if p divides a, then T(a, xo) = p—1. 


4. 


10. 


11. 
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Let g be a primitive root modulo the prime p. Prove that, for every 
integer b, the function x», defined by 


x0(g? + pZ) = e?69/(—-1) = e,_1(b3) (4.20) 


is a multiplicative character modulo p. 


Hint: Every congruence class in (Z/pZ)* is uniquely of the form 
g) +pZ for j = 0,1,...,p—2, and the map from (Z/pZ)* to Z/(p—1)Z 
defined by g? + pZ + j + (p—1)Z is an isomorphism. 


. Prove that the dual group of (Z/pZ)”* is the set of functions x» defined 


by (4.20) for b=0,1,...,p—2. 


. Prove that 


for b=0,1,...,p—2. 


. Prove that 


xe(—1 + pZ) = (-1)? 


for b=0,1,...,p—2. 


. Let p be an odd prime number, and g a primitive root modulo p. 


Define the multiplicative characters x» by (4.20). Prove that 


fp = X(p-1)/2- 


. Let x be a multiplicative character modulo m, and let a and b be 


integers relatively prime to m. Prove that 
x(4)X(Wa) = x(8)X(vo). 


Let xy be a multiplicative character modulo m. Prove that 


Let w be an additive character modulo m and xy a multiplicative 
character modulo m. Prove that 


RW) = d(x). 
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4.7 The Sign of the Gauss Sum 


For the odd prime number p, we consider the Gauss sum 


p-1 p-l 
Tp) = F(4,1) =~ € Je (k) = were, 
x=0 


k=1 
By Theorem 4.18, 


2 p  ifp=1 (mod 4), 
—p ifp=3 (mod 4), 


and so 
t\/p ifp=1 (mod 4), 
tiv/p ifp=3 (mod 4). 


In this section we determine the sign of 7(p). We shall prove that 


T(p) ={ ue tp=3 (mod 4). 


Recall that for the cyclic group G = Z/nZ of order n, the character 
group G consists of all functions of the form 


Wala + nZ) = en(ax). 


4 
—m 
s 
Swe 

II 
——s 
Liou 


Moreover, the map from G to G defined by a+nZ + w, is a group 
isoricrplicns If \ € L2(G), then there is a function \# € L?(G) defined by 


Mla + nZ) = A(Wa)- 


The map A +> A! is a vector space isomorphism from L?(G) onto L(G). 
The Fourier transform is a a vector space isomorphism from L?(G) onto 
ey Define F : L?(G) — L?(G) as the composition of the Fourier trans- 
form with the 4 map. If f € L?(G), then 


Fif\(atnZ) = (f)’ (a+ nZ) 


- S- f(a@+nZ)q(x2 + nZ) 


= Yiflet nd) 


where 
w= e,(1) = erry 


The linear operator F is also called the Fourier transform. 
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Theorem 4.22 For all functions f € L?(Z/nZ), 
F'(f)\(a+nZ) =nf(-—a+nZ). 


Proof. This is similar to the proof of (4.8) in Theorem 4.8. Writing 
F(f) = 9, we have 


n—-1 
g(a +nZ) = Se fiytnZ)w-% 
y=0 


and 


F*(f)(a+nZ) F(g)(a+nZ) 


n- 


= S- g(a + nZ)w 
xz=0 


n- 


n—-1 
= SY fly tnd Mw 


x=0 y=0 


n— n—-1 
= > f(y +nZ) S- wy laty) 
y=0 xz=0 


= nf(-—a+nZ). 


This completes the proof. 


The vector space L*(G) has a basis {6;}7. 
is defined by 


-1 
’ 


where the delta function 6; 


_f 1 ife=k (mod n), 
ixle + nd) = { 0 ifa#k (modn). 


We shall compute the matrix of the linear operator F with respect to this 
basis. We have 


n—-1 
F(Se)(G +nZ) = So 6. (a + nZ)w I =wF, 


x=0 


and so : 
FOR) = we 85, 
j=0 


Therefore, the matrix of F with respect to the basis {6,}729 is 


n-1 


ar (4.21) 


M(F) = (w4*) 
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For any positive integer n we define the Gauss sum 


By Theorem 4.17, this is consistent with our previous definition of 7(p) for 


p prime. Since w-* = w* for all integers k, it follows that the trace of the 
matrix M(F) is 


te(M(F)) = So w(-) =o w(R) = FH). 
k=0 k=0 


Since the determinant and trace of a linear operator on a finite-dimensional 
vector space are independent of the choice of basis for the vector space, it 
follows that the trace of the Fourier transform F on the group Z/nZ is the 
complex conjugate of the Gauss sum r(n). 


Theorem 4.23 Let n be an odd positive integer and G = Z/nZ the cyclic 
group of order n. Then the determinant of the Fourier transform F on 
L?(G) is 

_ f (Din? ifn =4k +1, 
det(F) _ { (—1)*in”/? ifn =4k +3. 


Proof. We shall compute the determinant of the matrix M(F) in two 
ways. Let w = e?7/". The square of M(F) is the matrix B = (tse) gees 
where 


n-1 


hy Suan Sue n ifj+k=0 (mod n), 
mee a 0 ifj+k#0 (mod n), 


and so (by Exercise 4) 
det(M(F))? = det(B) = (-1)®-D/2n” = in”. 


Then 


det(M(F)) = 46° D/2n”/?, (4.22) 


The determinant of M(F) is also a Vandermonde determinant (Nathanson [103, 
pp. 78-81]), whose value is 


det(F) = II @* —w4) 
0<j<k<n-1 

io IT] wow (92 2 wit-/2) 
0<j<k<n-1 


= LE, seer (-2 sin (<2) 


0<j<k<n-1 
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= II w G+k)/2 II (~2 sin 
0<j<k<n-1 


0<j<k<n-1 
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oo) 


= Ww Dosicren—1 9+8)/2(_jyn(n—1)/2 II 2 sin (S ~~ in) : 
n 


0<j<k<n-1 


We can compute the exponent of w as follows: 


; n—-1k-1 

jtk 1 
de = ALG TH) 

0<j<k<n-1 k=1 j=0 


- 72 GF) 


eee ae 
9 >) 
by Exercise 6. Since n is odd, it follows that 


j+k 
y i =0 (mod n), 
0<j<k<n-1 


ll 
3 
AN > 


and so ; 
jr Cupapeeeatn eh) 24. 


HO<j<k<n-1, then 0 < 43#) <q andsin (ED) > 0. Therefore, 


det(M(F)) =(-i"""V? JT asin (S 


0<j<k<n-1 


where 


I] 2sin (<2) 0: 


0<j<k<n-1 
Comparing (4.22) and (4.23), we obtain 


det(F) = (-4)"@-VPnr/?. 


By Exercise 7, 


n 


sat) (4.23) 


+ 1, 


_pyn(n—1)/2 _ (-1)* ifn=4k-4 
) =) (1% ifn=4k4 


This completes the proof. 


+ 3. 
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Theorem 4.24 Let p be an odd prime and G = Z/pZ the cyclic group of 
order p. Then the determinant of the Fourier transform F on L?(G) is 


p-2 


det(F) =p [] rw, 0), 


b=1 


where x» is the multiplicative character modulo p defined by (4.20) for 
b=0,1,...,p—2. 


Proof. The p— 1 functions x0, V1,---,Xp—2 are orthogonal in L*(G), 
since 


p-1 : 
es = _ jf p-l1 ifa=od, 
(Xa, Xo) = Yo xale + PEyRate + 72) = { A ifaZb 


by Theorem 4.7. Let 69 be the delta function at 0, that is, 


_f 1 ife=0 (mod p), 
fale +12)={ if~« #0 (mod p). 


Then 
(60, 60) =1 
and 
p—-l1 
(x0, 60) = S> xu(a + pZ)bo(a + pZ) = x0(pZ) = 0. 
x=0 


It follows that the set {60, x0, X1,---;Xp—2} is an orthogonal set of p func- 
tions in L?(G), and so is a basis for L?(G). This basis is called the basis 
of multiplicative characters for L?(G). We shall compute the matrix of the 
Fourier transform F with respect to this basis. 

For every congruence class a + pZ € G we have 


F(60)(a+pZ) = (va) 


= So b0(x + pZ)ba(a + pZ) 


xz=0 
= (pZ) 
1 
= 6o(a+pZ)+xo(a+ pZ), 


where yo is the principal multiplicative character modulo p. Therefore, 


F (60) = 69 + Xo. 


Similarly, 


F(xo)(at+pZ) = 


and so 


F(xo) = 
By Theorem 4.16, and by Exercises 6 and 7 in 


(mod p — 1), then 


F(xo)(a+pZ) = 


I 


and so 
FAX) a 


(—1)°r 
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X0(Wa) 
Y= xo(a 
x=0 

d, Bala + pZ) 


St 
ifa=0 


pa 
fs —1 ifa #0 
(p — 1)60(a + pZ) 


+ pZ)ba(a + pZ) 


(a + pZ) — 


(mod p) 
(mod p) 


xo(a + pZ), 


(p _ 1)é0 — X0- 


Xo(Ha) 

T(Xb, —@) 

T (xb, 1)X6(—a@ + pZ) 
T(xb,1)Xp-1-0(—a + pZ) 
(—1)°r(xo, 1)Xp—1-0(a + pZ), 


OG ii 
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Section 4.6, if b # 0 


(4.24) 


This determines the matrix of F with respect to the basis of multiplicative 
characters. For example, if p = 5, this matrix is 


1 4 0 0 0 
1 -l 0 0 0 
0 0 0 0 —1T(vx3,1) 
0 0 0 T(x2, 1) 0 
0 O -=T(x1,1) 0 0 


By Exercise 4, the determinant of this matrix is 


det(F) = 


p-2 
—p(—1)°-9)/? T] (-1)’r (x0, 1) 
b=1 

—2 p—2 
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This completes the proof. 


We can now determine the sign of the classical Gaussian sum. 
Theorem 4.25 If p is an odd prime, then 
= VB ifp=1 (mod 4) 
= Qnix? /p _ p y p= mo , 
T(p) = e Fo . 
(») 2 ee ifp=3 (mod 4). 
Proof. By (4.24), we have 
F(xo) = (—1)’r(xo, 1)Xp—1-5 
and so 
F'(xe) = F ((-1)’r(xo, 1) xp-1-5) 
(1) 7 DF Ogi) 
(- 1 


1) 
1)’r (xp, 1)(—1)?-*?r (xp_1-85 1) x0 
= (x0; 1)T(Xp-1-0, 1)Xo- 


I 


On the other hand, applying Fourier inversion (Theorem 4.22), we obtain 
F?(xv)(at+pZ) = pxo(—a+ pZ) 

xu(—1 + pZ)pxe(a+ pZ) 

(—1)"px0(a + pZ), 


I 


and so 
F? (xe) = (-1)’pxo- 
It follows that 
7(xbs1)7(Xp—1—b 1) = (-1)"p. 
Let r = (p— 1)/2. It follows from Exercise 8 in Section 4.6 that £, = yr 
and T(p) = T(xr, 1). By Theorem 4.24, 


p—2 


det(F) = p][rhw,1) 
b=1 


I 


pt(p) I T(Xb»1)7(Xp-1-0, 1) 
b=1 


r-1 


= pr(p) |J ((-1)’2) 


b=1 
= (—1)"°-D/2p(@-Y/27(p), 
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By Theorem 4.23, 


_ f (1p? ifp=4k +1, 
det(F) = { (—1)*ip?/? if p= 4k +3. 


If p= 4k +1, then r = 2k and 


(—1)°-D/2p(-1)/27(p) = (—1)8Ak-D p(P-)/2,(p) 
= (Say pe lrg) 
(—1)*pP/?, 
and so 
T(p) = VP. 


If p= 4k +3, then r= 2k +1 and 


(=P Bip / erp) 


I 


(=1)8OF pO Y/27(p) 


(—1)*p®-Y/?7(p) 
(—1)ip?/?, 


and so 


This completes the proof. 


Exercises 


1. Prove that 


>) _ 
2 (cos + cos =) = V5. 


and 


2 4 
2 (sin = + sin = sin) = v7 
Hint: Consider the Gauss sums 7(5) and r(7). 


2. Prove that 


JP ifp=1 (mod 4) and (4) =1, 

— ifp=1 (mod 4) and (2) =-1, 
MG ee 

i/p ifp=8 (mod 4) and (5) =1, 

—i/p ifp=3 (mod 4) and (7) =—1. 
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3. Let w = e?**/3, Compute the trace and determinant of the matrix 


1 1 1 
M= 1 we w 
1 ow wu? 


4. Let A= gy ped be an n— 1x n—1 matrix such that a;,, = 0 if 
j+k#0 (mod n). For example, ifn = 4, then 


0 0 Q1,3 
0 a2,2 0 
Q3,1 0 0 


Prove that 


dee (-1)0-D? TT) ajn—j if n is odd, 
(—1)"-2)/2 ese. Qjn—j if n is even. 
Let B= rebar be an n X n matrix such that b;;, =n ifj+k=0 


(mod n) and bj, =0if 7 +k #0 (mod n). For example, if n = 4, 
then 


ooo Fk 
LroOoCm oO 
ork OO 
oor CO 


Prove that 
_ ff (-1YO YP x | if n is odd, 
det(B) = { (—1)(-2)/2n" if n is even. 


5. Let I, denote the n x n identity matrix. Prove that M(F)* = n7I, 
and so 
det(F)* = n?”. 
6. Prove that for every positive integer n, 


n—1 


(3k? — k) =n(n—1)?. 


7. Let n be an odd integer. Prove that 


k . 
_yn(n—1)/2 _ (-1) ifn=4k+1, 
oa ={ (-1)*i ifn =4k+3. 


8. Prove that the Legendre symbol is an eigenvector of the Fourier trans- 
form with eigenvalue (—1)-)/?7(p). 


Hint: Exercise 8 in Section 4.6. 
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4.8 Notes 


A comprehensive survey of analysis and trace formulae on finite abelian 
and nonabelian groups is Terras, Fourier Analysis on Finite Groups and 
Applications [141]. Our proof of the sign of the Gauss sum uses an argument 
of Schur [126] that appears Landau [87, pp. 207-212] and Auslander and 
Tolimieri [7]. See Berndt and Evans [8] for a review of Gauss sums, and 
Berndt, Evans, and Williams, Gauss and Jacobi Sums [9] for an exhaustive 
monograph. 

For much more sophisticated studies of harmonic analysis in algebraic 
number theory, see Ramakrishnan and Valenza, Fourier Analysis on Num- 
ber Fields [120], and Weil’s classic Basic Number Theory [154]. 
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The abc Conjecture 


5.1 Ideals and Radicals 


In this chapter a ring is always a commutative ring with identity. An addi- 
tive subgroup I of a ring R is called an ideal if ar € I for every a € I and 
r © R. Both R and {0} are ideals in R. The set of even integers is an ideal 
in the ring Z. Indeed, every additive subgroup of Z is an ideal in Z. The 
set of polynomials with constant term equal to 0 is an ideal in the ring R[t] 
of polynomials with coefficients in the ring R. The intersection of a family 
of ideals is an ideal (Exercise 19 in Section 3.1). 

If A is a nonempty subset of the ring R, then the set of all finite linear 
combinations of the form ajr; + --- + azrg with a; € A and r; € Ris an 
ideal of R, denoted by (A) and called the ideal generated by the set A. 
An ideal generated by one element a € R is called a principal ideal and 
denoted by 

(a) =aR= {ar:re R}. 
A principal ring is a ring in which every ideal is principal. For example, 
Z is a principal ring by Theorem 1.3, and Z/mZ is a principal ring by 
Theorem 5.2. 

An ideal J in the ring R is called a prime ideal if I A R and ab € I 
implies a € I or b € I for all a,b € R. The spectrum of the ring R, denoted 
by Spec(R), is the set of all prime ideals of R. 


Theorem 5.1 The spectrum of the ring of integers is 


Spec(Z) = {pZ : p is prime or p = O}. 
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Proof. Since Z is principal, every ideal is of the form dZ for some non- 
negative integer d. If d = 0, then dZ = {0}, and the zero ideal is prime, 
since ab = 0 if and only if a= 0 or b=0. Let d> 1. If d= p is prime and 
ab € pZ, then p divides ab. By Euclid’s lemma, p divides a or p divides 8, 
and so a € pZ or b € pZ. Therefore, pZ is a prime ideal for every prime 
number p. 

If d is composite, then we can write d = ab, where 1 <a<b<d.lf 
a € dZ, then a = dk = abk for some positive integer k, and so 1 = bk, which 
is absurd. Therefore, a ¢ dZ and, similarly, b ¢ dZ. Since d = ab € dZ, it 
follows that dZ is not a prime ideal. Thus, the prime ideals in the ring Z 
are the ideals of the form pZ, where p is a prime number or p = 0. 


An element x in a ring R is called nilpotent if there exists a positive inte- 
ger k such that x* = 0. For example, the additive identity 0 is a nilpotent 
element of every ring, and the multiplicative identity 1 is never nilpotent. 
The congruence class 6+ 27Z is a nilpotent element in the ring Z/27Z. The 
set of all nilpotent elements in R is called the radical of the ring R, and 
denoted by N(R). Thus, the radical of the ring Z is {0}. By Exercise 6, 
the radical of a ring is a proper ideal in the ring. By Exercise 9, the radical 
of a ring is the intersection of the prime ideals in the ring. 

We shall compute the radical of the ring of congruence classes Z/mZ. 
Recall that the radical of the nonzero integer m is the product of the distinct 
prime numbers that divide m, that is, 


rad(m) = I[». 


p|m 
For example, rad(72) = 6, rad(30) = 30, and rad(—1) = 1. 


Theorem 5.2 For m > 2, let Z/mZ be the ring of congruence classes 
modulo m. Then 


(i) Z/mZ is principal, and the ideals of Z/mZ are the ideals generated 
by the congruence classes d+ mZ, where d is a divisor of m; 


(ii) the prime ideals of Z/mZ are the ideals generated by the congruence 
classes p+ mZ, where p is a prime divisor of m; and 


(itt) the radical of Z/mZ is the ideal generated by the congruence class 
rad(m) + mZ. 


Proof. Let J be an ideal in the ring R = Z/mZ. Consider the union of 
congruence classes 
T= U (a+ mZ). 
a+mZe J 
The set J is an ideal in Z. Since Z is principal, J = dZ for some positive 
integer d € I. Since m € mZ C TI, it follows that d is a divisor of m. 


5.1 Ideals and Radicals 173 


Moreover, d+ mZ € J, and so the principal ideal generated by d+ mZ in 
Z/mZ is contained in J. Ifa+mZ € J, then a € a+mZ C I, and so a = dr 
for some integer r. It follows that a + mZ = (d+ mZ)(r + mZ) belongs 
to the principal ideal generated by d+ mZ. Therefore, J is the principal 
ideal generated by d+ mZ, and a+mZ € J if and only if d divides a. (See 
Exercise 3 for a different proof.) 

Next we compute the spectrum of the ring Z/mZ. Let J be the principal 
ideal generated by d+ mZ, where d divides m and d > 2. If d= p is prime 
and 


(a+mZ)(b+mZ) =ab+mZe J, 


then p divides ab and so p divides a or p divides b, that is, a+ mZ € J or 
b+mZ € J, and J is a prime ideal. 

If d = abis composite, where 1 <a <b <d,thena+mZ ¢ J,b+mZ ¢€ J, 
but (a+ mZ)(b+ mZ) =d+mZ € J, and so J is not a prime ideal. Thus, 
the prime ideals of the ring Z/mZ are the ideals of the form p+mZ, where 
p is a prime divisor of m. 

Finally, the congruence class a + mZ is nilpotent in R if and only if 
(a+ mZ)* = a*+mZ = mZ for some positive integer k. Equivalently, 
a+m2Z is nilpotent if and only if m divides a* for some positive integer k. 
By Theorem 1.13, this is possible if and only if a is divisible by rad(m), and 
so N(Z/mZ) is the ideal generated by the congruence class rad(m) + mZ. 


Theorem 5.3 The ring C[t] of polynomials with coefficients in the field C 
of complex numbers is a principal ring. 


Proof. This is a special case of Exercise 18 in Section 3.1. 


Let f(t) € C[t] be a polynomial of degree n. If a1,...,a, are the distinct 
zeros of f(t), then we can factor f(t) into a product of linear terms of the 
form f(t) = cn [];_,(t — ai)’”’, where the leading coefficient c,, # 0 and 
my, +---+m, =n. The radical of the polynomial f(x) is defined by 

rad(f) = [[(¢— ai). 
i=1 


The zero set of the polynomial f(t) is the finite set 


Z(f) = {a € C: fla) =0} = {a1,..., 07}. 
Let No(f) denote the number of distinct zeros of f, that is, No(f) = 
|Z(f)| = r. The degree of the radical of f(t) is the number of distinct 
zeros of f(t), that is, 

deg rad(f) = No(f). 
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Theorem 5.4 Let f(t) € C[t] and R = C[t]/I, where I = (f(t)) is the 
principal ideal generated by f(t). The radical of R is the principal ideal 
generated by rad(f) + I. 


Proof. This follows immediately from the observation that if f(t) and 
g(t) are polynomials with complex coefficients, then there exists a positive 


integer k such that f(t) divides g(t)* if and only if rad(f) divides g(t). 


Exercises 


1. 


2. 


Determine rad(3”) and rad(n!) for all n > 0. 


Let mand n be nonzero integers. Prove that rad(mn) < rad(m)rad(n). 
Prove that rad(mn) = rad(m)rad(n) if and only if (m,n) = 1. 


. Let f: R— S bea surjective ring homomorphism. Prove that if the 


ring R is principal, then the ring S is also principal. Apply this to 
the map f : Z— Z/mZ defined by f(a) = a+ mZ. 


. Prove that a unit in a ring RF {0} is never nilpotent. 


. Let R be an integral domain, that is, a ring with the property that 


if 71,22 € Rand x1x2 = O, then x, = 0 or x2 = O. Prove that if 
@,...,%, € Rand x,---x, = 0, then x; = 0 for some 7. Prove that 
0 is the only nilpotent element in an integral domain. 


. Let R be a ring and let N(R) denote the set of all nilpotent elements 


in R. Prove that (R) is an ideal. 


Hint: Prove that if x is nilpotent, then xr is nilpotent for every r € 
R. Use the binomial theorem to show that if x* = y’ = 0, then 
(a+ y)FtR1 = 0. 


. Prove that if x is nilpotent, then x is contained in every prime ideal 


of R, and so 
NAG SP) ob 


TeSpec(R) 


. Prove that if x is not nilpotent, then there exists a prime ideal of R 


that does not contain x. 


Hint: Let S = {x* : k = 1,2,...}. Let Z be the set of all ideals 
in R that do not contain any element of S. If x is not nilpotent, 
then 0 ¢ S and {0} € Z. Use Zorn’s lemma to prove that the set Z 
contains a maximal element J, and that J is a prime ideal in R. such 
that INS = 90. 
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9. Prove that the radical of the ring FR is the intersection of all prime 
ideals of R, that is, 


N(R= [) FL 
TeSpec(R) 
10. Let a1,...,a@,% be divisors of m, and let [ai,...,az] be their least 


common multiple. Let (a;+mZ) denote the principal ideal generated 
by the congruence class a; + mZ in the ring R = Z/mZ. Prove that 


k 
( (ai + mZ) = ([ar,...,a%] +:mZ). 


i=l 


Hint: Observe that (a; + mZ) = a;Z and apply Exercise 30 in Sec- 
tion 1.4. 


11. Use Exercises 9 and 10 to prove that 


N(Z/mZ) = (rad(m) + mZ). 


12. Let J and J be ideals in a ring R. The product IJ is the ideal of R 
generated by the set of all elements of the form xy with x € J and 
y € J. In the ring Z, prove that the product of the principal ideals 
aZ and bZ is the ideal abZ. 


13. Let J and J be ideals in the ring R. We say that I divides J if I 
contains J, that is, J C I. Prove that if P is a prime ideal in R and 
if P divides the product ideal IJ, then P divides I or P divides J. 


14. Let J and J be ideals in Z. Prove that if I divides J, then there exists 
an ideal K in Z such that [kK = J. Prove that every ideal in Z is 
uniquely a product of prime ideals. 


5.2 Derivations 


A derivation on a ring Risa map D: R — R such that 
D(a + y) = D(x) + D(y) (5.1) 


and 
D(xy) = D(x)y + «D(y) (5.2) 


for all x,y € R. Condition (5.1) says that D is a homomorphism of the 
additive group structure of R. Condition (5.2) implies (Exercise 1) that 
D(1) = 0 and that, if x € R is invertible, then 


DG ae), 


ax 
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Moreover, it follows by induction (Exercise 2) that 
Ya -0j-1D(2j)ti41 +++ Sn 


for all 7,...,%, € R. 
The next result shows that the derivative is a derivation on a polynomial 
ring. 


Theorem 5.5 Let R be a ring and R{t] the ring of polynomials with coef- 
ficients in R. Define D: Rit] — R{t] by 


D bs ot) —_ S- ia;t’+. 
i=0 i=1 
Then D is a derivation on Rit]. 


Proof. Let f = f(t) = 772o ait* and g = g(t) = 7h _o byt?. It is imme- 
diate that D(f + g) = D(f) + D(g), and so D is a homomorphism of the 
additive group of polynomials. Since 


m+n 
= atts = 52D ath 
1=0 7=0 k=0 i+j=k 
we have 
m+n 
Difg) = Sok S> ajbjth 
k=1 i+j=k 
m+n 
= Sy rience 
k=1 i+j=k 
mr m+n 
= 3 S- ia;t* ‘bt +S > S- a;t' jb;t?—' 
k=1 it+j=k k=1 i+j=k 
m n m n 
= Soo iasti bjt + 57S ait! jbjt? + 
i=1 j=0 i=0 j=1 
= D(f)g+ fD(g). 


Therefore, D is a derivation on R[t]. 


An integral domain is a ring R such that if b1,b2 € R with b; # 0 and 
bo # 0, then bib2 ¥ 0. Corresponding to every integral domain is a field, 
called the quotient field of R. It consists of all fractions of the form a/b, 
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where a,b € R and b 4 0, and a/b; = ag/be if and only if ayb2 = agby. 
Addition and multiplication of fractions are defined in the usual way: If 
a1, 42, 61, be € R with by x 0 and ba x 0, then by be x 0 and 

ay ag a bo + aod; ay a2 a a2 


= d — - — = ‘ 
ae bibs a bs Babs 


The quotient field of Z is Q. If Flt] is the ring of polynomials with coeffi- 
cients in a field F’, then the quotient field of F'[t] is the field F(t) of rational 
functions with coefficients in F’. A careful construction of quotient fields 
can be found in the Exercises. 


Theorem 5.6 Let R be an integral domain with quotient field F’, and let 
D be a derivation on R. There exists a unique derivation Dr on F such 
that Dr(a) = D(a) for alla € R. 


Proof. Suppose that there exists a derivation Dr on F' such that Dr(a) = 
D(a) for alla € R. Let x € F,x #0. There exist a,b € R with b 4 0 and 
x = a/b. Since a = br € R, it follows that 


D(a) = Dr(a) = Dr(bx) = Dr(b)a + DD r(x) = D(b)x + bDr(z), 


and so 


Dr (2) Spee D(a) Pe = oe (5.3) 


Thus, the derivation Dr on F is uniquely determined by the derivation D 
on R. In Exercise 3 we prove that (5.3) defines a derivation on the quotient 
field Rp. 


Let D be a derivation on the field F’. For x € F', we define the logarith- 
mic derivative L(x) by 


If x,y € F*, then 
Dry) — D(z)y+eD(y) _ D(x) . DY) 


Oe nts re a 
and 
e\ D(x) , Dy) D(z) Diy) _ ee 
u(Z)- i ae: po ee 


by Exercise 1. 
We now consider polynomials with complex coefficients. A field F' is 
called algebraically closed if every nonconstant polynomial with coefficients 
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in F' has at least one zero in F. By the fundamental theorem of algebra, 
the field C is algebraically closed. Let f(t) € C[é], and let No(f) denote the 
number of distinct zeros of f(t). If f(t) has degree n with leading coefficient 
Gn, then f(t) factors uniquely in the form 


No(f) 
f(t) =an Tf (t-ai)™, 
i=l 
where a1,...,@N,(f) are the distinct zeros of f, the positive integer n; is 


the multiplicity of the zero aj, and ny +++: + ny vf) = n. If D is the 
derivation on C[¢] defined in Theorem 5.5, then, by Exercise 2, 


No(f) No(f) 
D(f) = an na(t—o4)™—* |] (t- 05)” 
a 2 
ae No(f) 
0 
f t=1 b= 4 


Let g(t) = dn eee — 6;)'" be a nonzero polynomial in C{t], and con- 
sider the rational function f/g € C(t). Then 


f No(f) es No(g) ini 
L{*+{)=L(f)-—L(g)= . a 5.4 
(4) = x11) - 200 Let ep 


This algebraic identity will be used in the next section to prove Mason’s 
theorem. 


Exercises 


1. Let D be a derivation on a ring R. Prove that D(1) = 0 and that, if 
xz € Ris invertible, then 


D(a") =- 
2. Let D be a derivation on the ring R. Prove that 
>a -0;-1D (a4) ti41+** En 


for all 71,...,@% € R. 


3. Let R be an integral domain with quotient field F’. Let D be a deriva- 
tion on R, and define the function Dr on F by (5.3). We shall prove 
that Dr is a derivation on the quotient field F’. 
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(a) Prove that Dr is well defined, that is, if a1/b1 = a2/be, then 
Dr(ay/b1) = Dp(az/be). 
(b) Prove that 


(c) Prove that 


a1 a2 a \ a2 , a a2 
Dr{|—-—)])=D + —D : 
aG = Gar b (5) 
4. Let R be a commutative ring with identity. A multiplicatively closed 
subset of R is a subset S such that 1 € S$ and if s,,s9 € S, then 


8182 © S. We consider the set of ordered pairs of the form (r,s) with 
r€ Rand s€S. Define a relation on this set as follows: 


(r,s) ~ (r’,s’) if s”(s'r — sr’) =0 for some s” € S. 
Prove that this is an equivalence relation. 


5. Let S~'R be the set of equivalence classes of the relation defined in 
Exercise 4. We denote the equivalence class of (r,s) by the fraction 
r/s. We also denote the equivalence class (r,1) by 7. Define multipli- 
cation of fractions as follows: 


Tri, 12 r1T2 


S, S92 8182 : 
(a) Prove that this multiplication is well defined, that is, if (r1, 61) ~ 
(71,81) and (12, $2) ~ (13, 8), then (rirz, $182) ~ (rr, $19). 


(b) Prove that multiplication in $~'R is associative and commuta- 
tive, and that the equivalence class of (1,1) is a multiplicative 
identity. 


(c) Prove that the equivalence class of (s,1) is invertible in S~'R 
for every s € S. 


(d) Prove that 


for alla€ Rands,s’€S. 


6. Define addition of fractions in S~!R as follows: 


T1 a T2 S271 + 8172 


$1 $2 $152 


180 


10. 


11. 
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(a) Prove that this addition is well defined, that is, if (71,51) ~ 
(r}, 84) and (re, $2) ~ (74, 84), then (seri + $112, $182) ~ (sory + 
fal: Piet 
8172, 8189). 
(b) Prove that addition in S~'R is associative and commutative, 
and that multiplication distributes over addition. Prove that the 
equivalence class of (0,1) is an additive identity. 


. (Localization) In Exercises 4-6 we proved that S$~'R is a ring. This 


ring is called the ring of fractions of R by S. We also say that S~'R 
is constructed by localizing R at S. 


(a) Prove that if 0 € S, then S~'R = {0}. 


(b) Prove that if R is an integral domain and 0 ¢ S, then S~!R is 
an integral domain. 


(c) Prove that if R is an integral domain and S is the set of all 
nonzero elements of R, then S~'R is a field. This field is called 
the quotient field of the integral domain R. 


. Define ys: R-> S7!R by gs(r) =r/l=r. 


(a) Prove that yg is a ring homomorphism. 


(b) Prove that if R is an integral domain and 0 ¢ S, then yp is 
one-to-one. 

(c) Prove that if R is an integral domain and S = R*, then S~!R 
is isomorphic to R. 
Hint: If S is a multiplicative subset of R and s € SM R*, then 
(r,s) ~ (s7'r,1) for all r € R. 


. Let S = {1,2,4,8,...} be the multiplicative subset of Z consisting 


of the powers of 2. Describe the ring of fractions S~!Z. What is the 
group of units in this ring? 


Let S = {+1,+3,+5,+7,...} be the multiplicative subset of Z con- 
sisting of the odd integers. 


(a) Describe the ring of fractions S~1Z. 
(b) Describe the principal ideal generated by 2 in this ring. 


(c) Prove that every element of the ring not in this ideal is a unit 
in S~1Z, and so (2) is a maximal ideal in S~!Z. 


Let p be a prime number and let S be the set of all integers not divis- 
ible by p. Prove that S$ is a multiplicative subset of Z, and describe 
the ring of fractions $~!Z. Prove that the principal ideal generated 
by p is a maximal ideal in S~!Z. 
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12. Let F[t] be the polynomial ring with coefficients in the field F’. Let 
S = {1,t,t?,t?,...} be the multiplicative subset of F[¢] consisting 
of the powers of t. Prove that S~'F{t] is isomorphic to the ring of 
Laurent polynomials with coefficients in F, that is, the ring consisting 
of all expressions of the form SScaen at’, where a; € F, and m and n 
are integers with m < n, and addition and multiplication are defined 
in the usual way. 


13. We consider the ring R = Z/12Z, and denote the congruence class 
a+12Z by a 


(a) Prove that S = {1,3,9} is a multiplicative subset of R. 
(b) Let ys : R — S~'R be the ring homomorphism constructed 


in Exercise 8. Prove that ys(@) = ygs(b) if and only if a = b 
(mod 4). 


(c) Prove that 1/3 = 3 in S“!R. 
(d) Prove that S~1R & Z/4Z. 


14. Let m > 2. We consider the ring R = Z/mZ, and denote the congru- 
ence class a+ mZ by a. Let S be a multiplicative subset of R such 
that 0 ZS. 


(a) Prove that we can factor m uniquely in the form m = mgm, 
where (mo,™m1) = 1, and if p is a prime number that divides m, 
then p divides mp if and only if there is a congruence class 5 € S$ 
such that p divides s. Show that (s,m) =1 for alls € S. 


(b) Prove that there is a congruence class 35 € S such that mo 
divides so. 


(c) Let gs : R ~ S7!R be the ring homomorphism constructed 


in Exercise 8. Prove that ys(@) = ygs(b) if and only if a = b 


(mod m1). 

(d) Prove that for every 5 € S there exists 7 € R such that 1/5 =7 
in S“'R. 
Hint: If s € S, then there exists an integer r such that rs = 1 
(mod m1). 


(ec) Prove that S-'R& Z/mZ. 


5.3. Mason’s Theorem 


This is an important diophantine inequality for polynomials. 
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Theorem 5.7 (Mason) I/fa,b,c € C[t] are nonzero, relatively prime poly- 
nomials, not all constant, and if 


a+b=c, 
then 
max{deg(a), deg(b), deg(c)} < No(abc) — 1 = deg(rad(abc)) — 1, 


where No(abc) denotes the number of distinct zeros of the polynomial abc, 
and rad(abc) is the radical of abc. 


Since Mason’s theorem is symmetric in a,b, and c, we could also write 
the equation in the form a+ b+c=0. 

Proof. Let D be the unique derivation defined on the rational function 
field C(t) by Theorems 5.5 and 5.6, and let L be the logarithmic derivative. 
We introduce the nonzero rational functions u = a/c and v = b/c in C(t). 


Then u+v = 1, and 
(2) (2 


D(u) + D(v) = D(ut+ v) = D(A) 
= 0. 


uL(u) + vL(v) 


I 


I 


Since L(v) 4 0 (by Exercise 1), we have 


Pe Pe ee (5.5) 


We write the standard factorizations of the polynomials a, b, and c as fol- 
lows: 


No(a) 


a = a(t)=ay II eee) hace 


Applying (5.4), we obtain 
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and 
b No(b) vr No(c) rE 
Liv) = L = 7 ; 
(v) (2) yy t— B; 2 t— Yk 


Since the polynomials a,b, and c are relatively prime, the radical of the 
product abc is 
No(a) No(b) No(c) 
q = rad(abc) = (t — aj) II ¢ (t — B;) (t— 4); 
i=l i=1 i=1 
and 
deg(q) = deg(rad(abc)) = No(a) + No(b) + No(c). 


Moreover, g£(u) and gL(v) are polynomials of degree at most deg(q) — 1. 
By (5.5), 


and so 

a(qL(u)) = —b(qgL(v)). 
Since the polynomials a and 0 are relatively prime, it follows that a divides 
qL(v), and so 


deg(a) < deg(qZ(v)) < deg(q) — 1 = deg(rad(abe)) — 
Similarly, 
deg(b) < deg(qL(u)) < deg(q) — 1 = deg(rad(abe)) — 


and 
deg(c) < deg(rad(abc)) — 


This completes the proof. 


Fermat’s last theorem states that if n > 3, then the Fermat equation 
gry” = 


has no solutions in positive integers. The Fermat equation has solutions in 
polynomials for n = 2, for example, 


(1—#)? +4)? = +27)?. 


We shall use Mason’s theorem to prove Fermat’s last theorem for polyno- 
mials for n > 3. 


Theorem 5.8 [fn > 3, then the Fermat equation x” + y” = z” has no 
solution in nonzero, relatively prime polynomials, not all constant. 
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Proof. Let n > 3, and suppose that x,y, and z are nonzero, relatively 
prime polynomials, not all constant, such that 2” + y” = z”. We apply 
Mason’s theorem with a = 2",b= y”, and c= z”. Then 


rad(abc) = rad(a"y"z") = rad(ayz). 
Since deg(a”) = ndeg(ax), we obtain 


ndeg(z) < mnmax(deg(x), deg(y), deg(z)) 
= max(deg(x"), deg(y”), deg(z")) 
= ne eee ), deg(c)) 
<  deg(rad(abc)) — 
= es ae -1 
< deg(xyz) — 
= deg(x) + a ) + deg(z) — 1. 

It follows that 


3(deg(a) + deg y + deg(z)) — 3 
n(deg(x) + deg y + deg(z)) — 3. 


n(deg(x) + deg y + deg(z)) 


This is impossible. 


Exercises 
1. Prove that L(v) # 0 in the proof of Theorem 5.7. 


2. Let n > 3. Prove that the equation 7” + y” = 1 has no solution in 
nonconstant rational functions x,y € C(t). 


3. (Nathanson [102]) The Catalan equation is the equation 
vc y” = Ly 


where m and n are integers greater than 1. Prove that this equation 
has no solution in nonconstant polynomials x,y € C[é] and integers 
m>2andn> 2. 


4. (Davenport [20]) Let f and g be nonconstant, relatively prime poly- 
nomials in C[¢]. Prove that 


deg(f® — 9?) > 5 dea(f) +1. 
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5. Let 
f = t+4t*+10t7+6 
g = +647 +2105 + 358 4 St 
Check that 
351 


fi —g? =27t* + Oo + 216. 


This example shows that the lower bound in Davenport’s theorem 
(Exercise 4) is best possible. 


5.4 The abc Conjecture 


The abc conjecture is a simple but powerful assertion about the relationship 
between the additive and multiplicative properties of integers. Recall that 
the radical of a nonzero integer m is the largest square-free divisor of m, 
that is, 

rad(m) = II D. 


p\m 


The abc conjecture states that for every ¢ > 0 there exists a number K(e) 
such that, if a,b, and c are nonzero, relatively prime integers and 


a+b=c, 


then 
max(|a|, |b|, |c]) < K(e)rad(abc)'**. 


Since the inequality is symmetric in a,b, and c, the equation can also be 
written in the form a+ b+c= 0. To prove or disprove this conjecture is 
an important unsolved problem in number theory. 
From the abc conjecture it is possible to deduce many theorems and still 
unproven propositions in number theory. Here are some examples. 
Fermat’s last theorem states that, for n > 3, the Fermat equation 


x+y” = 2” (5.6) 


has no solution in positive integers. Note that if x, y, z is a solution of (5.6) 
in positive integers and if a prime number p divides x and y, then p also 
divides z, and 2/p, y/p, z/p is another solution of the equation. It follows 
that if the Fermat equation has a solution in integers, then it has a solution 
in relatively prime integers. 


Theorem 5.9 (Asymptotic Fermat theorem) The abc conjecture im- 
plies that there exists an integer no such that the Fermat equation has no 
solution in relatively prime integers for any exponent n > no. 
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Proof. Let x,y, and z be relatively prime positive integers such that 


rt y = 2". 


We note that 
rad(a"y"z") = rad(xyz) < xyz < 2°. 
If n > 2, then z > 3. Applying the abc conjecture with « = 1 and ky, = 
max(1, K(1)), we obtain 
2” = max(x2", y™, 2") < Kyrad(x"y"2z")? < K,2°, 
and so 


log K log kK 
bys £64 
log z 


log3 ~ 


This completes the proof. 


The Catalan conjecture asserts that 8 and 9 are the only consecutive 
powers. Equivalently, it states that the only solution of the Catalan equation 
gm” ay" =1 
in integers x,y,m,n all greater than 1 is 

S22 ST 
It is known that the diophantine equation x” — y? = 1 has no solution in 
positive integers, and that the only solution of the equation x? — y” = 1 in 


positive integers is x = n = 3 and y = 2. Therefore, it suffices to consider 
the Catalan equation only for min(m,n) > 3. 


Theorem 5.10 (Asymptotic Catalan theorem) The abc conjecture im- 
plies that the Catalan equation has only finitely many solutions. 


Proof. Let (x, y,m,n) be a solution of the Catalan equation with min(m,n) > 
3. Then x and y are relatively prime. It follows from the abc conjecture with 
é = 1/4 that there exists a constant Ky = K(1/4) such that 


y” <a2™ < Korad(2™y")*/4 = Korad(ay)*/4 < Ko(ay)*’*, 


and so P 
mloga < log Ky + Z (log x + log y) 


and 


5 
nlogy < log Ky + a (log x + log y) . 
It follows that 


5 
mlogaz+nlogy < 2log K2+ 5 (log + log y) , 
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and so 


5 5 
(m =) log x 4 (u =) logy < 2log Ko. (5.7) 
Since x > 2 and y > 2, we have 


2log K: 
mabye oH Beas 
log 2 


Thus, there are only finitely many pairs of exponents (m,n) for which 
the Catalan equation is solvable. For fixed exponents m > 3 and n > 8, 


equation (5.7) has only only finitely many solutions in positive integers x 
and y. This completes the proof. 


For every odd prime p we have 2?-!' = 1 (mod p), that is, p divides 
2?-1 _ 1. The question of the divisibility of 2?~! — 1 by p? arose in the 
study of Fermat’s last theorem. An odd prime p such that 


2?-' £1 (mod p”) 


is called a Wieferich prime. For example, 3, 5, and 7 are Wieferich primes, 
since 2741 (mod 9), 2441 (mod 25), and 2°41 (mod 49). It is 
not known whether infinitely many Wieferich primes exist, nor is is known 
whether there are infinitely many primes that are not Wieferich primes. 

Let W be the set of Wieferich primes. We shall show that the abc con- 
jecture implies that W is infinite. We begin with a simple lemma. 


Lemma 5.1 Let p be an odd prime. If there exists a positive integer n such 
that 2” = (mod p) but 2" 41 (mod p?), then p is a Wieferich prime. 


Proof. Let d be the order of 2 modulo p. Then d divides n. Since 2” # 1 
(mod p?), it follows that 24 # 1 (mod p?). Then 27 = 1+ kp, where 
(k,p) = 1. Moreover, d divides p — 1, since 2?-! = 1 (mod p), and so 
—1= de for some integer e such that 1 < e < p—1. Then (ek, p) = 1 and 


gP-1 — (24) = (14+ kp)°=1+ekp#1 (mod p’), 


and p is a Wieferich prime. 


A powerful number is a positive integer v such that if a prime p divides 
v, then p? divides v. For example, 72 is powerful but 192 is not. If v is 
powerful, then rad(v) < v!/?. 


Theorem 5.11 The abc conjecture implies that there exist infinitely many 
Wieferich primes. 
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Proof. Let W be the set of Wieferich primes. For every positive integer 
n, we write 


2” —1=Unn, 


where v,, is the maximal powerful divisor of 2” —1. Then uy, is a square-free 


integer, 
Un = II P; 
p|n 
vp(n)=1 
and 


Un = II pr (n) ; 
| 


pire 
vp(n)>2 


If p divides un, then 


but 
2” 41 (mod p’). 


It follows from Lemma 5.1 that p € W, and so u, is a square-free integer 
divisible only by Wieferich primes. 

If the set W is finite, then there exist only finitely many square-free 
integers whose prime divisors all belong to W, and so the set {uy :n = 
1,2,3,...} is finite. It follows that the set {v, :n = 1,2,3,...} is infinite, 
and, consequently, unbounded. Since v,, is powerful, we have 


rad(v,) < vl/?. 
Let 0 < ¢ <1. Applying the abc conjecture to the identity 
(2” —1)+1= 2", 
we obtain 
Un 


(QP =i 


Gio) 


Qun)'t*rad(un)* Te 


Vem 


RAN IA IA A 


This implies that the numbers v,, are bounded, which is absurd. This com- 
pletes the proof. 
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Exercises 


1. 


For a fixed exponent n > 4, prove that the Fermat equation z”+y” = 
z” has at most a finite number of solutions in positive integers a, y, z. 
Does this argument show that the cubic Fermat equation 23+y? = z3 
has at most finitely many solutions? 


Hint: Apply the abc conjecture with ¢ = 1/6. 


An integer n is powerful if vp(n) # 1 for all primes p. Compute the 
powerful numbers up to 100. 


Let n > 2 be an integer. Define the power of n by 


logn 
power(n) = losnad(ay. 
Prove that power(n) = 1 if and only if n is square-free. Prove that 
if n is powerful, then power(n) > 2. Prove that if n is a kth power, 
then power(n) > k. 


. (Granville) Prove that the abc conjecture implies that there exist only 


finitely many triples of consecutive powerful numbers. 


Hint: Suppose that n — 1,n,n +1 are three consecutive powerful 
numbers. Apply the abc conjecture to the equation (n? —1)+1=n?. 
Observe that 


rad(n?(n? — 1)) 


l| 


rad((n — 1)n(n + 1)) 
< (n — 1)n(n + 1) < n3/?, 


Let = 
U=|J{e*: ce N} = {u}®, 
k=3 


be the set of nonsquare powers of the positive integers, where u; < 
uj+1 fori =1,2,.... Prove that the abc conjecture implies 


lim (Wind om Ui) =o. 
—0o 


tu 
Prove that the abc conjecture implies that the diophantine equation 
ni+1=m? 


has only finitely many solutions. 


Hint: Apply the inequalities 


[[e<” 


pan 
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(Theorem 8.1) and 


()" <<) 


(Exercise 1 in Section 6.2). 


Slr 


7. Prove that the abc conjecture is false if we omit the condition (a, b,c) = 
ile 


Hint: Consider the equation 3* + 2-3* = 3*+1, 


8. In this exercise we construct an example to show that the abc con- 
jecture would be false if we replaced the exponent 1+ with 1. 


(a) Prove that for every positive integer n there exists a positive 
integer u, such that 


2"u, +1=32"—. 


Hint: Euler’s theorem. 
(b) Let an = 2” un, bn = 1, and cp = 32""" Prove that 


6-32" 


rad(dnbpCn) = rad (6un) < afi 


(c) Let K(0) > 0. Prove that if n is sufficiently large, then 


6K (0)cn 


K(0)rad(anbncen) < on 


< Cyn = mMax(An, bn, Cn). 
Since ay, + by = Cn, this is the desired counterexample. 


9. Let a and b be relatively prime positive integers. We define c= a+b 


and 
log c log(a+ 6) 


L(a,b) = = . 
kab} lograd(abc) log rad(ab(a + b)) 


It is hard to find relatively prime integers a and b for which L(a, b) 
is large. Use the equation 


2+ 319109 = 23° 


to compute L(2,31°109). In October,1999, this was the largest known 
value for L(a, 6). 


10. Compute L(a,b) for a= 1 and b= 2-3". 


11. Compute L(a,b) for a = 11? and b = 3? -5°®- 73. 
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12. For n > 1, define the positive integer t,, by 
9" =1+4+ 8ty. 
Prove that L(1,8t,) > 1 and so 


lim sup L(a, b) > 1. 
(a,b)=1 


It can be shown that the abc conjecture is equivalent to 


lim sup L(a, b) = 1. 
(a,b)=1 


5.5 The Congruence abc Conjecture 


Let m > 2. The congruence abc conjecture for m states that for every e > 0 
there exists a number K(m,¢) such that, if a,b,c are nonzero, relatively 
prime integers with 

abe =0 (mod m) 


and 
a+b=c, 


then 
max(lal, |b], [el) < K(m, e)rad(abe) +, 


This a weaker assertion than the abc conjecture, which is unrestricted by 
any congruence condition. However, we shall prove that if the congruence 
abc conjecture is true for some modulus m, then the unrestricted abc con- 
jecture is also true. 

We begin with some simple observations about triples (a, b, c) of integers 
such that a+ 6 = c. First, at least one of the integers a,b, or c must be 
even, and so abc = 0 (mod 2). Therefore, the congruence abc conjecture 
for m = 2 is the same as the abc conjecture, and we need to consider only 
moduli m > 3. Second, if (a,b,c) = 1, then either c is odd and b — a is 
odd, or c is even, both a and b are odd, and 6 — a is even. Third, if a,b,c 
are distinct nonzero integers, then, by a permutation, we can assume that 
they are positive anda <b<ce. 


Lemma 5.2 Let a,b,c be relatively prime positive integers such that 
a<b<e 


and 
at+b=c. 
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Let n > 2. If c is odd, define 


An (b— a)”, 
B, = c"—(b-a)", 
Gyn = oc™ 


If c is even, define 


a 
ay <8 
| II 
~~ 
NlO o 
ee 
s NI | 
fr) 
ee Cae 
—~ 3 
ee 
| 
a 
Noy 
2 


Then Ay, Bn,Cn are distinct, relatively prime positive integers such that 
An + Bn = Cn. 
Ifm>3 and n= v(m), then 
An BnC, =90 (mod m). 


Proof. It is left to the reader to show that A,,Bn,Cy are distinct, 
relatively prime positive integers such that A, + B, = C, (Exercises 1, 2, 
and 3). 

Let m > 3 and n= y(m). Then n > 2. We must prove that 


Ay, BnC, =90 (mod m). 
It suffices to prove that if p is a prime and p” divides m, then 
A, BnCn =0 (mod p’). (5.8) 


Note that if p is a prime and p” divides m, then (p—1)p"~1 divides n, and 
so 
ro! € (p—1)p" 1 <n. 
Suppose that p is an odd prime. If p divides c, then p” divides c” and p” 
divides C;,. Since r < n, it follows that C;, = 0 (mod p’). Similarly, if p 
divides b—a, then A, =0 (mod p’). If p divides neither c nor b—a, then, 
by Theorem 2.12, 
cH)" = 1 (mod p”) 
and Xs 
(b— a)?" =1 (mod p’). 


Since (p — 1)p"~+ divides n, we have 


c’ =(b—-a)"=1 (mod p’), 
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and so B, =0 (mod p"). This proves (5.8) for odd primes p. 

Finally, we consider the prime 2. If 2” divides m, then 2"~1! divides n and 
r <n. If cis even, then b—a is even and exactly one of the integers c and 
b — a is divisible by 4 (Exercise 4). It follows that either c” or (b— a)” is 
divisible by 4”, and so either C;,, or Ay is divisible by 2”, which is divisible 
by 2”. 

If c is odd, then b — a is odd and 


a (b- or =1 (mod 2"). 
Since 2"—! divides n, we have 


By, =c"—(b-—a)"=0 (mod 2”). 


This proves (5.8) for the prime 2. 


Theorem 5.12 Let m > 3. If the congruence abc conjecture is true for m, 
then the abc conjecture is true. 


Proof. Let 0 < ¢ < 1. For triples a,b,c of distinct, relatively prime 
positive integers such that a+ b= c, we define the function 


®.(a,b,c) = loge — (1 + €) log rad(abc). 
Then 
eloge ®,(a,b,c) 
l+e l+te © 
Let A, B,C be distinct, relatively prime positive integers such that ABC = 


0 (mod m) and A+ B = C. If the congruence abc conjecture is true for 
m, then there exists a constant K(m,¢) > 0 such that 


log rad(a, b,c) = loge 


C < K(m,e)rad(ABC)'T*, 
or, equivalently, 
®.(A, B,C) < log K(m,<) = K*(m,¢). 

Let a,b,c be relatively prime positive integers such that a < b < c and 

a+b=c. Let 
n= y(m). 

Then n is even, by Exercise 4 in Section 2.3. Define the integers Ay, Bn, Cn 
as in Lemma 5.2. Then A,B,C, = 0 (mod m) and A, + By = Ch. 


Moreover, 
®-(An, Bn, Cn) < K*(m,e). 
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The integer n is even, since m > 3, and so, by Exercise 5, 


By 


c — (b—a)” 


= (b+a)"—(b- a)" 


4ab ((b+a)”~? + (b+a)"4(b 
4ab (5) (b+a)"? 


IA 


= 2abnc"-?. 


Since 


it follows that 


rad(A,ByCn) 


Therefore, 


lograd(An,BnCn) < 


= nloge 


II 
a 
| 


IA 
oS 
Re 
| 
ea, 
— 


< (ee) G2. Salada) 


Corecpe aaa| 


Equivalently, 


®.(a, b, c) < 


ay? +++. +(b 


Q 


< rad(b—a)rad (=) rad(abc) 


2 (bg) (=) rad(abe) 


(b — a) (2nc”~”) rad(abc) 
2nc"~trad(abc). 


(n — 1) loge + log rad(abc) + log 2n 


Eloge ©,(a,b,c) 


® b 
n E 


®.(a, b, c) 
l+e 


os) (log C;, + nlog 2) 


+ 2nlogn. 
me nlogn 


(l+e)n 
ae | log rad( An BnCn) 


+ 2(1 + e)nlog2 


A 


I 


2 (toe Cy - 


2 (log Cy, — (1 +e’) log rad(An BnC,)) + 4n log 2, 


(l+e)n 
——-~__+— _}] A,B 4n log 2 
(a ograd(A,B,C,) ) + 4nlog 
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where 


Since 
log C, — (1+ €’) lograd(A, BnCy) = ®2(An, Bn, Cn) < K*(e’,m), 
it follows that 
®.(a,b,c) < 2K*(e',m) + 4y(m) log 2. 


Thus, for every ¢ > 0, the function ®,(a, b,c) is bounded above, and this 
is equivalent to the abc conjecture. This completes the proof. 


Exercises 


1. Let a,b,c positive integers such that (a,b,c) = 1 and a+b = c. Prove 
that (a,b) = (a,c) = (b,c) = 1. Prove that a = b only if a = 1 and 
eS; 


2. Let a,b,c be relatively prime positive integers such that c is odd, 
a<b<c, and 
a+b=c. 


For every positive integer n, define 


A, = (b-a)", 
B, = c”-(b-a)", 
Cy -S 6". 


Prove that A,, B,, and C, are distinct, relatively prime positive 
integers such that 
An + By = Ch. 


3. Let a, b, and c be relatively prime positive integers such that c is 
even, a<b<c, and 
a+b=c. 


For every positive integer n, define 


ea 


2 
3 
I 


2 
II 
ae 
ey 
ae 

3 
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Prove that An, Bn, and Cy, are distinct, relatively prime positive 
integers such that 
A,+Bn,=Ch.- 


4. Let a,b,c be relatively prime integers such that a+ b= c. Prove if ¢ 
is even, then exactly one of the integers c and b — a is divisible by 4. 


5. Prove that if n is even, then 


(b+a)"—(b—a)” = 4ab ((b + a + (b+ a)”~*(b - a)? +--+ (b- ar) ; 


5.6 Notes 


One of the most fruitful analogies in mathematics is that be- 
tween the integers Z and the ring of polynomials F[t] over a 
field F. 


S. Lang [89, p. 196] 


There are beautiful survey articles on the abc conjecture by Lang, “Old 
and new conjectured diophantine inequalities” [88], Nitaj, “La conjecture 
abe’ [113], and Brzeziriski, “The abc-conjecture” [15]. Part of Lang’s article 
appears in his Algebra [89, pages 194-200], which is a highly recommended 
reference for all matters algebraical. 

The abc conjecture was motivated in part by Mason’s theorem, which is 
a polynomial analogue of the abc conjecture (see Mason [97]), and in part 
by a conjecture of Szpiro on the discriminants of elliptic curves (Lang [88]). 
According to Oesterlé [114, pp. 167-169], Szpiro had discussed this conjec- 
ture in a lecture in Hanover in 1983; the abc conjecture arose in a discussion 
between Masser and Oesterlé in 1985. 

Browkin and Brzezinski [14] contains considerable data on the values of 
the function L(a, b), discussed in Exercises (9)—(12), as well as a conjectured 
generalization of the abc conjecture to equations of the form a, +ag+---+ 
a, = 0. The proof that the congruence abc conjecture implies the abc 
conjecture is due to Ellenberg [27]. 

Fermat’s last theorem was proved by Taylor and Wiles [139, 156] in 
1995. For a different proof of Fermat’s last theorem for polynomials, see 
Greenleaf [41]. For a proof that the Catalan equation has no solution in 
polynomials or rational functions, see Nathanson [102]. 

V. A. Lebesgue [91] proved that the diophantine equation 2” = y? +1 
has no solution in positive integers. Chao Ko [82] proved that the only 
solution of x? = y™ + 1 in positive integers is 7 =m = 3 and y = 2. 

Silverman [134] applied the abc conjecture to Wieferich primes (Theo- 
rem 5.11). Wieferich [155] proved that if p is an odd prime such that the 
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Fermat equation 
xP + y? = 2P 
has a solution in integers x,y,z with (p, yz) = 1, then 
2?-'=1 (mod p’). 


Computations [17] suggest that such primes are rare, and that “most” 
primes are Wieferich primes. Indeed, 1093 and 3511 are the only primes p < 
4-10? that are not Wieferich primes. It is an open problem to determine 
whether there exists a prime p that satisfies the following two congruences: 


2?-'=1 (mod p”) 


and 
3?-'=1 (mod p?). 


Part II 


Divisors and Primes in 
Multiplicative Number 
Theory 


6 


Arithmetic Functions 


6.1 The Ring of Arithmetic Functions 


An arithmetic function is a complex-valued function whose domain is the 
set of positive integers. For example, the divisor function d(n) and the Euler 
phi function y(n) are arithmetic functions. 

The pointwise sum f + g of the arithmetic functions f and g is defined 
by 

(f+ 9)(n) = f(r) + g(n). (6.1) 

There are two natural ways to multiply arithmetic functions f and g. The 
first is the pointwise product f - g, defined by 


f-g(n) = f(n)g(n). 
The second is the Dirichlet convolution f * g, defined by 


(f *g)(n = 21 g(n/d)= S~ f(dg (6.2) 


dd/=n 


where the sum is over all positive divisors d of n. Dirichlet convolution 
occurs frequently in multiplicative problems in elementary number theory. 
We define the arithmetic function 6(n) b 


1 ifn=1, 
a(n) = 0 ifn>2, 


and the zero function 0(n) by 0(n) = 0 for all n. 
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Theorem 6.1 The set of all complex-valued arithmetic functions, with ad- 
dition defined by pointwise sum and multiplication defined by Dirichlet con- 
volution, is a commutative ring with additive identity O(n) and multiplica- 
tive identity 6(n). 


Proof. It is easy to check that the set of arithmetic functions is an 
additive abelian group with the zero function as the additive identity. 

We shall prove that Dirichlet convolution is commutative, associative, 
and distributes over addition, that is, 


feg=ag*f, 
(f*g)*h=fx*(g*h), 


and 
fe(gth)=fegtfeh 


for all arithmetic functions f,g, and h. These are straightforward calcula- 
tions. We have 


fxg(n ne g(n/d) = 5° g(n/d) f(a) = S— g(a) f(n/d) = g * f(n) 


d|n d\n 
and 
((F*g)*hy(n) = S(f*9)(@n (5) 
dln 
= Do (f*9)\(@h(m) 
dm=n 
= aD tl 
dm=n kl=d 
ae en ) 
kkm=n 
= Sof) SS g(Oh(m) 
k|n lm=n/k 
= Se) YS sOr(S) 
k|n L\(n/k) 
= Seah ign) (4 ) 
k|n 
= (f*(g*h))(n). 
Similarly, 


fe(gth)(n) = d_f0 (g(n/d) + h(n/d)) 
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= Lie g(n/d) + S~ f(d)h(n/d) 


d|n 


ee )+ f * h(n). 


I 


Finally, we observe that 


= 20 f(n/d) = f(n) 
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for every arithmetic function f, and so the arithmetic functions form a 
commutative ring with multiplicative identity 6(n). This completes the 


proof. 


Recall that a derivation on a ring R is an additive homomorphism D : 


R— R such that 
D(xy) = D(x)y + xD(y) 
for all x,y € R. 


Theorem 6.2 Consider the arithmetic function L(n) defined by 


L(n)=logn for alln > 1. 


Pointwise multiplication by L(n) is a derivation on the ring of arithmetic 


functions. 
Proof. Observe that if d is a positive divisor of n, then 
L(n) = L(d) + L(n/d). 
We must prove that 
bef haf) sors * eg) 
for all arithmetic functions f and g. We have 


L-(f*g)(n) = L(n)S° f(dg(n/d) 


d|n 


= 2 Hn) g(n/d) 
= es + L(n/d)) f(d)g(n/d) 


d|n 


= DAE g(n/d) + 9° f(a) L(n/d)g 


d\n 
= cL eeied: 
This completes the proof. 


(n/d) 
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Exercises 

1. Define the arithmetic function 1(n) by 1(n) = 1 for all n. Prove that 
1*1(n) = d(n). 

2. For every positive integer k, let d,(m) denote the number of k-tuples 
of positive integers (a1,d2,...,@,) such that n = a,a2---ax. Prove 
that 

dy(n) =1l*1*--+-* 1(n). 
k times 

3. Let f and g be arithmetic functions. Prove that f * g = 0 if and only 
if f =0 or g = O. It follows that the ring of arithmetic functions is 
an integral domain. 

4. Let A be the ring of complex-valued arithmetic functions. An arith- 
metic function f is called a unit in A if there exists an arithmetic 
function g such that f *g = 6. Prove that f € Ais a unit if and only 
if f(1) £0. 

5. For every positive integer NV, let Iy be the set of all arithmetic func- 
tions f(n) such that f(n) = 0 for all n < N. Prove that Iy is an 
ideal in the ring of arithmetic functions. 

6. Let f and g be arithmetic functions. Prove that 

“(n 
L” - EP EP eT hg. 
G0=)>, ( : fxL'g 
k=0 
7. Let 7 be the additive abelian semigroup consisting of all sequences 


J = {j;}S2, of nonnegative integers such that 7; = 0 for all sufficiently 
large 1. Addition of elements in 7 is defined coordinate-wise. 


Let t1,t2,... be an infinite sequence of variables. For every J © J we 
define the monomial 
es |e 
Ki2l 


If J is the sequence with j; = 0 for all i, then t? = 1. Let R be the 
set of all expressions of the form 


> ajt?, 
JET 


where the coefficients a; are complex numbers. We define the sum 
and product of elements of R by 


ys ast? + DD bjt? = S- (az + by)t? 


JET JET JET 


10. 
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and 


&s eat) (x bat?) = S- age 


JET J2ET J, J2ET 


Prove that R is an integral domain, that is, a commutative ring with 
no zero divisors. 


Remark. This ring is called the ring of formal power series in in- 
finitely many variables t, tg... with coefficients in C. It is denoted 
by Cl[t1, ta, oe J]. 


. Let P = {p1, po, p3,...} be the sequence of primes in ascending order, 


that is, p) = 2,p2 = 3,p3 = 5,... . By the fundamental theorem of 
arithmetic, to every positive integer n we can associate a sequence 
Jn, € J as follows: If 
Co 
n= [[2" a 
i=1 


then 
In = {Up (M) }21- 
Prove that this is a bijection between N and 7. 


. Let A be the ring of complex-valued arithmetic functions. For every 


arithmetic function f € A we define the formal power series 


&(f) = SO f(njt” € Ctr, te... J, 


neN 


where J, € J is the sequence constructed in Exercise 8. Prove that 
the map 
@:A— C|[t1, te, oe J] 


is a ring isomorphism. 


Remark. Since the ring of formal power series in infinitely many vari- 
ables is a unique factorization domain, it follows that the ring of 
complex-valued arithmetic functions is also a unique factorization 
domain. 


For arithmetic functions f and g, define the product f * g by 


n-1 


f xg(n) = >> f(k)g(n— k). 


k=1 


Is this product commutative? Is it associative? What is f x 6? 
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6.2 Mean Values of Arithmetic Functions 


We define the mean value F(x) of an arithmetic function f(n) by 


= 5° f(n) 


n<ux 


where the sum is over all positive integers n < x. In particular, F(a) = 0 
for « < 1. The function F(x) is also called the sum function of f. We 
shall describe two simple but powerful tools for estimating sum functions in 
number theory. The first is integration and the second is partial summation. 

The integer part of the real number x, denoted by [2], is the unique 
integer n such that n < « < n+ 1. The fractional part of x is the real 
number {x} = x — [2] € [0,1). For example, [—32] = —2 and {—3} = §. 
Every real number «x can be written uniquely in the form x = [a] + {z}. 

A function f(t) is unimodal on an interval I if there exists a number 
to € I such that f(t) is increasing for t < to and decreasing for t > to. For 
example, the function f(t) = log* t/t is unimodal on the interval [1, 00) 
with to = e*. 

It is proved in real analysis that every function that is monotonic or 
unimodal on a closed interval [a, b] is integrable. 


Theorem 6.3 Leta and b be integers with a < b, and let f(t) be a function 
that is monotonic on the interval [a,b]. Then 


b b 
min(f(a), f()) < So f(n) -f f(t)dt < max(f(a), f(0)). (6.3) 


n=a 


Let x and y be real numbers with y < [a], and let f(t) be a nonnegative 
monotonic function on ly, x]. Then 


> fn or ae f(t)at| < max(f(y), f(2)). (6.4) 


Yy<n<ar 


If f(t) is a nonnegative unimodal function on [1,co), then 


Pa= n= fo f(t)dt + O(1). (6.5) 


»u 


Proof. If f(t) is increasing on [n,n + 1], then 


n+1 
f(n) < / f(t)dt < f(n +). 


If f(€) is increasing on the interval [a,b], then 


a+ f 1 yat < 3 F(n) ) < f(b) + [so dt. 


n=a 
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Similarly, if f(t) is decreasing on the interval [n,n + 1], then 
n+1 
fins tvs fo f@dt< fn), 
nm 


If f() is decreasing on the interval [a, b], then 


b b b 
f00)+ f soars Ys) < slay + f poate 


This proves (6.3). 
Let f(t) be nonnegative and monotonic on the interval [y, a]. Let a = 
[y] + 1 and b = [2]. We have y<a<b< a. If f(t) is increasing, then 


So = So Fo) 


Y<nce a<n<b 
b 
< J soa £0) 
< fo foaes se. 
Since 2. 
f(a) > / f(Odt 
and . 
f(e) > | f(t)dt, 


it follows that 


b 
> f(n) > / f(t)dt + f(a) 


Yy<n<ca 


IV 


[roa [roars - f° soe 


IV 


ff teoae— ra), 
y 
Therefore, 


Sf (a): 


Y sm " f(t)dt 


y<n<ax 


If f(t) is decreasing, then 
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b 
< [fod Fo 
< f f@de+ flu) 
Since . 
f(b) = : f(t)dt 
and 


it follows that 


b 
S> f(n) > ‘ f(t)dt + f(b) 


> f soars s0)- | “Fae f° spat 
> jf foae— su) 


and 
ES sey fsa < sy) 


Y<n<an 


This proves (6.4). 
If the function f(t) is nonnegative and unimodal on [1,0o), then f(t) is 
bounded and (6.5) follows from (6.4). 


Theorem 6.4 For x > 2, 
SS, logn = log —x+ O(log). 
n<ux 


Proof. The function f(t) = log ¢t is increasing on [1,2]. By Theorem 6.3, 


J ostat < Sogn < f log tdt + log x, 
1 1 


nN<ax 


and so 
S- logn = xlogx —x+ O(log z). 


n<u 


This completes the proof. 
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Theorem 6.5 Let r be a nonnegative integer. For x > 1, 


log" n 1 ay 
y = ray og’ «+ O(1), 


n<ux 
where the implied constant depends only on r. 


Proof. The function f(t) = log” t/t is nonnegative and unimodal on 
[1,co) with maximum value (r/e)” at tp = e”. By Theorem 6.3, 


log” n ” log” tdt 1 ach 
) = i ; + O(1) 741 8 z+0O(1) 


nN<2x 


This completes the proof. 


Theorem 6.6 Let k be a nonnegative integer. For x > 1, 


log* (x/n 1 
S- g'(a/n) _ 


= l k+1 l k 
ee oe z+ O(log” x), 


n<ux 
where the implied constant depends only on k. 
Proof. The idea is to expand log*(a/n) by the binomial theorem and 
apply Theorem 6.5. We have 


log®(x/n log x — logn)* 
yy EE: 2, See gn) 


nm n 
nsx n<a 


( 
= $ (*) a log**2 2 4.0 pe (*) log*-? : 


r=0 


= ee log**! « + O(log* a), 


since 
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by Exercise 8. 


Theorem 6.7 Let k be a positive integer. Then 


1 1 
Ny NE<U , 


where ee denotes the sum over all k-tuples of positive integers 
(n1,.--, 7%) such that ny-+-np <a. 


Proof. By induction on k. For k = 1, we set r = 0 in Theorem 6.5 and 
obtain 


1 
S- — =logr+O(1). 
ny 


n1<x 


Assume that the result holds for the positive integer &. Then 


se 1 


nr eee n nr 
ieee kNR41 


- Yo FY > 


nr 
Ne-1 Se pire Ny Nh a/Ne4L 


= (Files Ce/rnsn) + O(tou!"e/nes2))) 


Nk+1 k! 


Ne-1 Se 


1 k 
a S- Kinga (log x — log nx+1) 


Nk+1 Sz 
1 
+ O | log**a 
Net1 Se Meta 
1 
= Ss in (log « — logn)* + O (log* x) : 
oan Bin 


We use the binomial theorem and Theorem 6.5 to compute the main term. 


k 


k 
(-1)" ( ) log*~" x log” n 
(a 


| 
M 
=| 4 


1 
x Ela (logx —logn)* = 


n<a n<a«  =r=0 
k 
(<1) /B\ ger x log" 
=> ] 


= FN agra (Latest 00) 
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k 


= a log! ay = (*) +O (log x) 


r=0 


= — log*t!r+0 (log x) ; 


by Exercise 8. 


Theorem 6.8 (Partial summation) Let f(n) and g(n) be arithmetic func- 
tions. Consider the sum function 


F(x) = 0 f(n). 


Let a and b be nonnegative integers with a < b. Then 


b 
do f(m)g(n)_ = F(b)g(b) — F(a) g(a + 1) 


n=at+l1 


— So F(n)(g(n +1) — g(n)). (6.6) 


n=at+l1 


Let x and y be nonnegative real numbers with [y| < [a], and let g(t) be a 
function with a continuous derivative on the interval [y, x]. Then 


do f(r)g(n) = F(a) g(a) — Fol) -f F(t)g'(t)dt. (6.7) 
Yy<n<ca 7] 

In particular, if x > 2 and g(t) is continuously differentiable on [1, x], then 
de f(n)g(n) = F(a) 9(c) -| F(t)g'(t)dt. (6.8) 
n<ux 1 

Proof. Identity (6.6) is a straightforward calculation: 


b 
dS F(rg(n) 


n=at+l1 
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If the function g(t) is continuously differentiable on [y, xz], then 
nt+1 
gint1)—9(n)= fg (oat. 
Since F(t) = F(n) forn<t<n+1, it follows that 
n+1 
F(n) (g(n+1)~g(n)) = f F(eg' (be 


Let a = [y] and b= [a]. Sncea<y<a+1<b<2<b+1, we have 


do f(n)g(n) 


y<n<ax 


b 
= > f(n)g(n) 


n=atl1 
b-1 
= F(d)g(b) - Fl@g(a+1)- Yo Fln)(g(n +1) — g(n)) 
oe n+1 
= F(2)9(b) -Fy)ga+)- So | F(t)g'(t)at 


This proves (6.7). 
If « > 2 and g(t) is continuously differentiable on [1, x], then 


S~ F(n)g(n) 


n<ux l<n<a 


= f)gQ) + F(x)g(@) — Fg) - i F(t)g'(t)dt 


I 
Ss 
F ae 
os 
sS 
as, 

ee 
Sat 
Sb 
, 
= 
SS 
: eat 

3 
~ 


= Flog(a)— f Feg'(oae 


This proves (6.8). 


Letting r = 0 in Theorem 6.5, we obtain 5°, -,, 1/n = log x+O(1). Using 
partial summation, we can obtain a more precise result. 
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Theorem 6.9 For x > 1, 


1 
So 4 = loge +7 + r(e), 


n<ux 
where 1} 
OS A 
and 
In(2)] < + 
= 


The number y = 0.577... is called Euler’s constant. A famous unsolved 
problem in number theory is to determine whether ¥ is rational or irra- 
tional. 

Proof. Since 0 < {t} < 1 for all t, we have 


and so 7 € (0,1). 


We apply partial summation to the functions f(m) = 1 and g(t) = 1/t. 
Then F'():=')" 2; 1 = [4] and 


= = YK Flerain) 

By fe 
oie Pe f es 
= teal I har) [ at - 
= logr+y4tr(a 


where a3 
8 gy 2h 
t2 


x 


r(x) = 


Moreover, |r(x)| < 1/a since 0 < {a}/a < 1 and 
POAT ican | 1 
0< Gat < f dt = —. 
ha 60 x 
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Theorem 6.10 Let A = {a;}S2, be an infinite set of positive integers with 
ay < a2 < a3 <c+e. If 


Ate) = )1=0/ as ) 


2 
ae log x 


for x > 2, then the series 


ey) 


converges. 
Proof. Let x(n) be the characteristic function of A, that is, 


_f 1 ifneA, 
XM =) 0 tng A 


There exists a number c such that 


for all a > 2, and A(x) < 1 for 1 < a < 2. Applying partial summation, we 
obtain 


ee xa(n) 
ai<a ” n<a a 
= A(z) ce i A(t)dt 
x 1 
2 ae ae an dt 
~ log?a 2 2 tlog?t 
ce. Sons 2 108 dy 
log? * 2 tefl we 
< Ow. 


This completes the proof. 


Theorem 6.11 For x > 2, 


S¢ log’ n = x log? x — 2x log x + 2x + O (log? x) . 


n<u 
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Proof. We use partial summation with f(n) = 1 and g(t) = log? t. Then 
F(t) = [¢] and g'(t) = 2logt/t. Then 


* [t] log t 
Slog? n = ix] log? — 2 f f] log dt 
1 


t 
= («— {x}) log? —2 | EOS 


se ” {tllogt 
= «log? x + O(log? s)-2 | logtat +2 f Hest a 
1 1 


n<ux 


= slog’ — 2xrloga + 2x + O(log? 2). 


This completes the proof. 


Theorem 6.12 For x > 2, 


iy log? - = 2x + O (log? zx). 


n<ux 


Proof. From Theorem 6.4 and Theorem 6.11, we obtain 


S- log? - YS (log « — log n)? 


I 


n<ax n<a 
a S- (log? « — 2 log «log n + log? n) 
n<ax 
= [x] log” x — 2loga S~ logn + Slog? n 


nsx nx 
= «log? x—2log2(xloga — x) + xlog* x — 2x loga + 2x + O (log? zx) 
2x + O (log? zx). 


This completes the proof. 


Exercises 
1. Prove that 
n\r n\n? 
e(*) <n <en(*) : 
€ € 
Hint: Use partial summation to estimate log nl. 


2. Let f(n) be an arithmetic function such that 


F(«) = SU f(n) = O(@). 


n<ux 
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Prove that 


. Prove that 


1 1/2 | ~ {t} 1/2 


n<x 


. For 0 <a< 1, let 


Prove that 


. Prove that 


S- log’ n = «log* x + O(xlog*~! x) 


n<u 


for all positive integers k. 


. Prove that 


SZ log Sega O(log x). 
n 


nN<ux 


. Prove that 


ys log” * = kle + O(log” x) 
n 


nN<ax 


for all positive integers k. 


. Prove that for every nonnegative integer k, 


be 


A (-l" (k\ 1 
Pes r) k+l 


. Prove that for every positive integer J, 


j+1 
1 


7 

. J 
y hee 
n=1 ins 


+ O(r*). 
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10. Let a,b and k be positive integers, with a < b and k > 2. Prove that 


yee G-2)+0(3). 


Prove that 


11. Prove that 


1 
———. = O(log] ; 
Pa eer lst Oleg log) 


6.3. The Mobius Function 


The Mobius function ju(n) is defined as follows: 


1 ifn=1, 
u(n) = ¢ (—-1)* if nis the product of k distinct primes, 
0 if n is divisible by the square of a prime. 
We have 
w(4) = 0, w9) = 9, 
1(5) ils (10) = 1 


An integer is called square-free if it is not divisible by the square of a prime. 
Thus, y(n) 4 0 if and only if n is square-free. 

Recall that an arithmetic function f(n) is multiplicative if f(mn) = 
f(m) f(m) whenever (m,n) = 1. 


Theorem 6.13 The Mobius function p(n) is multiplicative, and 
1 ifn=1, 
Ta =| ‘ieee (6.9) 


Proof. Multiplicativity follows immediately from the definition of the 
Mobius function, since if m and n are relatively prime square-free integers 
with k and @ prime factors, respectively, then mn is square-free with k + @ 
factors, and 
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Next we prove the convolution formula (6.9). If n = 1, then 

SiG) Su) =, 

d\n 
For n > 2, let 

m= plt--- pr 
be the standard factorization of the integer n. Then r > 1. Recall that the 
radical of n is the largest square-free divisor of n, that is, 
rad(n) = p++ Pp 


is the product of the distinct primes dividing n. Let m = rad(n). If d divides 
n and p(d) 4 0, then d is square-free, and so d divides m. Since m is the 
product of k primes, it follows that there are exactly (*) divisors of m that 
can be written as the product of i distinct primes, that is, the number of 
divisors d of m such that w(d) = i is eas Therefore, 


So u(d) = So u(d) 


d|n d|m 


This completes the proof. 
We defined the arithmetic function 1(n) by 1(m) = 1 for all n. Using the 
Dirichlet convolution, we can restate Theorem 6.13 as follows: 


pxl=6, 
and so the Mobius function ps is a unit with inverse 1. 


Theorem 6.14 (M6bius inversion) [f f is any arithmetic function, and 
g is the arithmetic function defined by 


g(n) = 5> f(d), 
d\n 
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then 


f(n) = Sn (4) 9(a). 
d|n 


Similarly, if g is any arithmetic function, and f is the arithmetic function 
defined by 
n 
f(n) = dou (=) g(d), 
dln 


then 


g(n) = S> f(d). 
| 


Proof. We use Theorem 6.13 and the commutativity and associativity 
of Dirichlet convolution. The definition 


g(n) = S> f(d) 
d| 


is equivalent to 
g=f*l. 
Then 
g*w=(fel)euHfxeep)=fx6=f. 
Similarly, if 
f=9*u, 
then 
fel=(g*p)*l=g* (url) =grd=g. 


This completes the proof. 


The following result gives a useful identity for sum functions of arithmetic 
functions. The proof can be described geometrically as a sum over the 
lattice points (m,d) under the hyperbola v = a/u in the positive quadrant 
of the uv-plane. 


Theorem 6.15 Let f(n) be an arithmetic function and 


F(x) = 0 f(n). 


n<u 


Then 
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Proof. We have 


M 
zy 
——~ 
SIs 
II 


yd fO= 10 


m<ax m<ad<a/m dm<a 
= Yi@ Y 1=Y [5]. 
d<a m<a/d dr 
= Ses f(d). 
n<x din 


Also, 


~ F(=)= ¥ f@=L Ye. 


m<au dm<a n<a din 


This completes the proof. 


Theorem 6.16 


we obtain 
(=) =a [2] =o Va =1, 
m<a d<a n&x d\n 
by Theorem 6.13. Since 
a BE Dae rm {2}. a +0(e), 


it follows that 


d<a 
Therefore, 
d 
x S- ma) = O(a), 
d<a 
and so 


6.3 The Mobius Function 


This completes the proof. 


Theorem 6.17 


ln) a =+o(=). 


Proof. The Riemann zeta function 


n<x 


n=1 
converges absolutely for s > 1. Similarly, the function 


G(s) = — (12) 


ns 
n=1 


converges absolutely for s > 1. Therefore, 


(se = YE M 


by Theorem 6.13, and so 


n=1 
for s > 1. Since 
Se nr? 
2 = SS 
¢(2) a aoe 
it follows that 
ie = 2 un) 6 
CQ) 2a 
and so 
(nr) 6 | A) Lg 
n2 Te] n? S n2 = x 
n<x n>x n>2x 


This completes the proof. 
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Exercises 
1. Compute p(n) for 11 < n < 30. 


2. Let f(n) be an arithmetic function, and define g(n) = })4),, f(d). Use 
Mobius inversion to write f(30) as a sum and difference of values of 
the arithmetic function g. 


3. Let d(n) be the divisor function. Prove that 
n 
d(k)u(*) =1 
So dk) ; 
k|n 


for every positive integer n. 
Hint: Problem 1 in Section 6.1. 


4. Let o(n) denote the sum of the positive divisors of n, that is, 


a(n) = > k. 


k|n 


Seth (B)=n 


k|n 


Prove that 


for every positive integer n. 


5. Let f(a) be a function on the set of real numbers x > 1. Define the 


function g(x) by 
g(a) = > f (=). 


Prove that 


x 
f(z) = 7 nln)g (=). 
n<x 
6. Let g(x) be a function on the set of real numbers x > 1. Define the 


function f(x) by 
f(x) = 2 wln)g (=). 


n<x 


ge) = >> 5 (=). 


nN<ax 


Prove that 


7. Let a > 0. Let f(x) be a function on the set of real numbers x > 1. 
Define the function g(x) by 


10. 


11. 
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Prove that 


fay= Ee (e): 


n<al/e 


. Let a > 0. Let g(x) be a function on the set of real numbers x > 1. 


Define the function f(x) by 


fa= yy Mee (). 


Prove that 


. Prove that every positive integer n can be written uniquely in the 


form n = k?, where k and @ are positive integers and @ is square- 


free. Prove that 
w?(n) = S- ud). 


d?|n 


Prove that the density of the square-free integers is 6/7?. Equiva- 
lently, let Q(x) denote the number of square-free integers not exceed- 
ing x. Prove that 


lim Ae 


@Z0o 6 2 


Hint: n is square-free if and only if u?(n) = 1, and 


Qa) = Tem) =X wa [SZ] =S + 0a). 


n<a d?#<a 


Define the von Mangoldt function 


_ J logp if n = p® is a prime power, 
MO) { 0 otherwise. 
Let 
L(n) = log n. 
Prove that 
L=1*A 
and 


A(n) = — S> p(d) log d. 
d|n 


224 6. Arithmetic Functions 
6.4 Multiplicative Functions 


In this section we prove some general properties about multiplicative arith- 
metic functions. 


Theorem 6.18 If f is a multiplicative function, then 
f({m, n}) fm, n)) = fm) f(n) 


for all positive integers m and n. 


Proof. Let p;,...,p, be the prime numbers that divide m or n. Then 


Tr 
= ki 
t=1 
and 
+ 
= a 
m= P; ’ 
t=1 


where ky,...,k,,€1,...,€ are nonnegative integers. Then 
TZ pymax(hi sls) 
ki bi 
[m,n] = |] pm 
i=1 


and E 
in(ki,l:) 
(mn) =[[or"?™. 
w=1 


Since 
{max(k;, €;),min(k;, ¢;)} = {ki, Ci} 


and since f is multiplicative, it follows that 


flim, nl) f(Gnjn)) = II 5 (pinot) Il 5 (pmintse) 


= Tro) Ire 
= f(m)f(n). 


This completes the proof. 


Theorem 6.19 Let f be a multiplicative function with f(1) =1. Then 


S/ u(d) f(a) = [[G — f(p)). 
d\n 


pin 
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Proof. The identity holds for n = 1. For n > 2, let m = rad(n) be 
the product of the distinct primes dividing n. Since y(d) = 0 if d is not 
square-free, it follows that 


SS" ud) f(a) = S u(a) f(@) = [A - fe) = [[G - Fo). 
d\n 


d\|m p|m pin 


This completes the proof. 


The sequence of prime powers is the sequence 
2,3,4,5, 7, 8,9, 11,13, 16,17,19, 23,25,27,.... 
The smallest power that is not a prime power is 36. 


Theorem 6.20 Let f(n) be a multiplicative function. If 


lim f(p") =0 


p* 00 
as p® runs through the sequence of all prime powers, then 


lim f(n) =0. 


n— Co 


Proof. Since lim,«_,.. f(p*) = 0, it follows that there exist only finitely 
many prime powers p* such that |f(p")| > 1, and so we can define 


A= |] If"). 
L@|>1 


Then A > 1. 
Let 0 <e <1. There exist only finitely many prime powers p* such that 
|f(p")| = e/A, and so there are only finitely many integers n such that 


€ 
FON = 5 


for every prime power p* that exactly divides n. Therefore, if n is sufficiently 
large, then n is divisible by at least one prime power p* such that | f(p*)| < 
e/A, and so n can be written in the form 


r+s r+st+t 


Tr 
n=[][o" [To IL ee. 
i=1 


i=rtl i=rt+s4+1 


where pj,...,Pr+s+z are distinct prime numbers such that 


lf (wi 


>1 for GH Ves 0h 1% 
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E. ky : 
q SNF: <1 fori=r+1,...,r+s, 
E 
| f (pe se fori=r+s+1,...,r+s+t, 
and 
t>1. 
Since f is multiplicative, 
is r+s r+st+t 
ki k, ki 
fl = [Plf@r) TT #@M TI lt@®)| < Ae/A4)! <e. 
i=l i=r+l i=r+st+1 


This completes the proof. 


Exercises 


1. 


Let f be a multiplicative function. Prove that if f(1) = 0, then f is 
identically equal to 0, that is, f(n) = 0 for all n. Prove that if f is 
not identically equal to 0, then f(1) = 1. 


Prove that a multiplicative function is completely determined by its 
values on prime powers p*. 


Prove that if f and g are multiplicative functions, then f * g is also 
multiplicative. 


. Define the arithmetic functions w(n) and Q(n) as follows: If 


k kp 
N= py '+*-p, 


is the standard factorization of the positive integer n, then 
w(n) =r 
is the number of distinct prime divisors of n, and 
O(n) = ky +--+ +k, 


is the total number of prime factors of n. Prove that n is square-free if 
and only if w(n) = Q(n). Prove that the arithmetic function (—1)“(” 
is multiplicative. 


An arithmetic function f is called completely multiplicative if f(mn) = 
f(m)f(n) for all positive integers m and n. Prove that Liouville’s 
function 
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is completely multiplicative. Prove that 
S>A(d) =: { 1 if nis a square, 
d| 


0 otherwise. 


6. Prove that for every 6 > 0, 


lim 


=o. 
NCO mi-é 


Hint: Apply Theorem 6.20 to the multiplicative function f(n) = 
n'~°/y(n). Observe that 


pk(1—8) Z 9 
Pls pe 2) - ye 


) ( ) 
| | 1-—+)>2 1-—] >=. 
2) 2 
p|n ( P k=2 k : 
Hint: Consider the identity 


0< 


7. Prove that 


5 < <1 
2 n? 
Hint: Observe that for every prime power p*, 
ep)o)_, 1 .,_1 
per =] pett = pe 


9. Prove that 
n<a(n)<«n'*? 


for every 6 > 0. 
Hint: Apply Exercise 6 and Exercise 8. 


6.5 The mean value of the Euler Phi Function 


The Euler phi function is 
g(n) =n] | Gas =n = Ss d'u(d). (6.10) 
p|n dln d’/d=n 


We shall find an asymptotic formula for the mean value of the Euler phi 
function. 
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Theorem 6.21 For x > 1, 
3a? 
&(2) = So y(n) = —z + O(elogz). 


Proof. We have 


®(z) = DS) on) 
= SY du 

n<a d’d=n 
= Su@ Sd 
d<a d’<a/d 


2 a d 
d<a d<a 

x? SS p(d) 2? x pd) 

5 > D 5 p + O (x log x) 
#=1 aoe 
2 

= 52” + O(wloga). 
uy 


This completes the proof. 


Theorem 6.22 The probability that two positive integers are relatively 
prime is 6/1. 


Proof. Let N > 1. The number of ordered pairs of positive integers 
(m,n) such that 1<m<n<N is N+(3) = N(N +1)/2. The number 
of positive integers m < n that are relatively prime is y(n), and so the 
number of pairs of positive integers (m,n) such that 1 <m<n< N and 
m and n are relatively prime is 

3N? 
S> y(n) = —z + O(NlogN). 
n<N 
Therefore, the frequency of relatively prime pairs of positive integers not 
exceeding N is 


N° +O(NlogN) 6 o (8) 6 


N(N+1/2... N 
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as N — oo. This completes the proof. 


Exercises 


1. Use Mobius inversion to prove identity (6.10): 


y(n) = n> _ 
d|n 


2. Prove that 
lim sup a) = 1, 
n 


n—0o 


Hint: Consider y(n) for n = p prime. 


6.6 Notes 


Everything in this chapter is classical number theory. For other elementary 
results on arithmetic functions, see Hardy and Wright [60]. 

There is a vast literature on the distribution of values of arithmetic func- 
tions. For a comprehensive survey of this field, see Elliott, Probabilistic 
Number Theory I, II [28, 29]. 
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Divisor Functions 


7.1 Divisors and Factorizations 


The divisor function d(n) counts the number of positive divisors of n. Thus, 


dil) = 1, d(6) = 4, 
d(2) = 2, d(7) = 2, 
d(3) = 2, d(8) = 4, 
d(4) = 3, d(9) = 38, 
d(5) = 2, d(10) = 4. 


1 
We can write down an explicit formula for d(n) in terms of the prime powers 
that exactly divide n. Let 
n= [x : 


pin 
Every divisor d of n is of the form 
d= |p, 
pln 
where a, is an integer such that 
0 < ap < Up(n). 
Since each exponent a, can be chosen in v,(n) + 1 ways, it follows that 


d(n) = | [(wp(n) +1). 


p\n 
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Theorem 7.1 The divisor function d(n) is multiplicative. 


Proof. Let m and n be relatively prime integers, 


m= lle” 


plm 


n= IIe. 


q|n 


and 


Since (m,n) = 1, the set of primes that divide m and the set of primes that 
divide n are disjoint. Therefore, 


mn = II prl™) II gra) 
plm q|n 
is the standard factorization of mn, and 


d(mn) = T] (vp(m) + 2) T] (va(n) +1) = a(m)a(n). 


p|m q|n 


This completes the proof. 


Theorem 7.2 For every ¢ > 0, 
d(n) Ke n*. 


Proof. Let ¢ > 0. The function f(n) = d(n)/n* is multiplicative. There- 
fore, by Theorem 6.20, it suffices to prove that 


lim f(p*) =0 
pk—oo 
for every prime p. We observe that 


k+1 
Dke/2 


is bounded for k > 1, and so 


for = 


IA 
Fae 
et 
x) 4 
Oy 
Bat 
N] eS 
Ny 
Caan 
3 
ca 
gO) 
Shs, 
bo 
Se 


< 
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This completes the proof. 


Theorem 7.3 For x > 1, 


Dx) = S- d(n) = xlogx + (2y —1)a + O(V/2). 


n<ux 


The problem of estimating the sum function D(z) is called Dirichlet’s 
divisor problem. 

Proof. We can interpret the divisor function d(n) and the sum function 
D(x) geometrically. A lattice point in the plane is a point whose coordinates 
are integers. A positive lattice point in the plane is a point whose coordinates 
are positive integers. In the uv-plane, 


d(n)=S°1= S01 
d\n 


N=UV 


counts the number of lattice points (u,v) on the rectangular hyperbola 
uv = n that lie in the quadrant wu > 0,v > 0. The sum function D(z) 
counts the number of lattice points in this quadrant that lie on or under 
the hyperbola uv = x, that is, the number of positive lattice points (u, v) 
such that 1 <u<aand1<v< <2/u. These lattice points can be divided 
into three pairwise disjoint classes: 


(i) 
l<u<Ve and l<v<vz2, 


l<u<VJe and Va<v<z/u, 


Je<u<a2 and l<v<za/u. 


The third class consists of the lattice points (u,v) such that 
l<vu<JVe and Ve<u<z/v. 


It follows from Theorem 6.9 that 


Di) = [val+ > ([Z]-Wva)+ X ((-4) 


l<u<V/a l<v<s fa 


= [va+2 S ([F]-[va) 


l<u< Ja 
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- 2S (ar 


l<us ve 

= 2) €-({E})-Wwe-{vay)’ 
lsu ve 

= 2 --2 S- {=}-2+ O(va) 
lsu Ve l<usVve 

= Ww (108 vi ty 4 o(+)) x + O(V/z) 


= slogxr+(2y-1)r4+ O(V2). 


This completes the proof. 


Theorem 7.4 For «> 1, 


A(x) = > (logn — d(n) + 2y) =O (2¥/?) : 


n<ux 


Proof. By Theorem 7.3 we have 


S- d(n) = xlogxr+(2y-1)a+O ee) ; 


n<ux 


By Theorem 6.4 we have 


S¢ logn = rlogx — x + O(log zx). 


n<u 


Subtracting the first equation from the second, we obtain 


os (logn — d(n) +2y) =O (2?) — 2y{x} + O(log x) =O (x1?) 


n<ux 


An ordered factorization of the positive integer n into exactly @ factors 
is an ¢-tuple (di,...,d¢) such that n = d,--- dg. The divisor function d(n) 
counts the number of ordered factorizations of n into exactly two factors, 
since each factorization n = dd’ is completely determined by the first factor 
d. For every positive integer @, we define the arithmetic function de(n) as 
the number of factorizations of n into exactly ¢ factors. Then dj(n) = 1 
and d2(n) = d(n) for all n. 
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Theorem 7.5 For every ¢ > 1, the function de(n) is multiplicative, and 
a+€-1 
de(p*) = 
e(p") ( Pat ) 


Proof. Let (m,n) = 1. For every ordered factorization of mn into ¢ 
factors we can construct ordered factorizations of m and n into @ parts, as 
follows. If mn = d,---dg is an ordered factorization of mn into ¢ parts, 
then, by Exercise 20 in Section 1.4, for each i= 1,...,@ there exist unique 
integers e; and f; such that e; divides m, f; divides n, and d; = e; fi. 
Then m = e;---eg and n= f,--- fe are ordered factorizations of m and n, 
respectively. This construction is reversible, and so establishes a bijection 
between ordered factorizations of mn and pairs of ordered factorizations of 
m and n. It follows that de(mn) = de(m)de(n), and so the divisor function 
dg is multiplicative. 

An ordered factorization of the prime power p* can be written uniquely 
in the form p* = p!---p'¢, where (b1,...,6¢) is an ordered ¢-tuple of 
nonnegative integers such that b; + ---+ be = a. It follows that dg(p*) is 
exactly the number of ordered partitions of a into exactly @ nonnegative 
parts. Imagine a sequence of a+—1 red squares. If we choose ¢—1 of these 
squares and color them blue, then the remaining a red squares are divided 
into exactly @ subsequences (possibly empty) of consecutive red squares, 
separated by blue squares. Every ordered partition of a into @ nonnegative 
parts can be uniquely constructed in this way, and so dg(p*) is the number 
of ways to choose ¢— 1 squares from a set of a+ ¢—1 squares, that is, 


de(p"*) = (" ? 


for all prime powers p*. 


This completes the proof. 


Theorem 7.6 For é > 2, 


1 
(¢—1) 


De(x) = S > de(n) = 


nN<ax 


Vv log ta+O (x log’? x) : 


Proof. The proof is by induction on ¢. By Theorem 7.3, D(a) = x log x+ 
O(a). Now assume that the result holds for some integer £ > 2. The nota- 
tion }74,...q, Means a sum over all ordered ¢-tuples (d1,...,de) of positive 
integers. Applying Theorem 6.7, we obtain 


Dez) = So deyi(n) 


n<ux 
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ae 


n<x dy sdeqy =n 


= 


n<a dy---de|n 


= ae 


dy--de<x 
=e > a +O S- 1 
dy--dg<x dy--de<x 
£ 
= TE * +0 (clog! 2) + O(D¢(2)) 
1.2 
= —F = +0 (elog’“" x) 


This completes the proof. 


Exercises 


1. Compute d(n) for 11 <n < 20. 
2. Prove that n is prime if and only if d(n) = 2. 


3. Prove that d(n) is prime if and only if n = p?~!, where p and q are 
prime numbers. 


4. Prove that d(mn) < d(m)d(n) for all positive integers m and n. 


II d= nir/2. 


d|n 


5. Prove that 


6. Prove that 
S- d?(n) > clog? x. 


n<x 


Hint: Apply the Cauchy—Schwarz inequality to D?(z). 


Remark. In Theorem 7.8 we obtain an asymptotic formula for 7, <,, d?(n). 


7. Let w(n) denote the number of distinct prime divisors of n, and let 
Q(n) denote the total number of prime divisors of n. Prove that 


qu(n) < d(n) < g(r) _ 


Prove that d(n) = 2“(") if and only if n is square-free. 


10. 


11. 


12. 


13. 


14. 


15. 


(2 
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. Let 6 > 0 and x > e®. Prove that the number of positive integers 


n <x with d(n) > (log x)!*° is O(x(log x)~°*). 
Hint: D(x) = O(a log x). 


. Let r > 1 and a > e®. Prove that the number of positive integers 


n <x with w(n) > rloglog x is O (x(log x)'~"°8?). 


Find all positive integers & < 10 such that 44 +1 and 64 +1 are 
simultaneously prime. Let ny = 12k + 2. Prove that if 4k + 1 and 
6k + 1 are simultaneously prime, then d(n,) = d(nz + 1). 


Remark. It is an unsolved problem to determine whether there are 
infinitely many integers n such that d(n) = d(n+ 1). 


Prove that 


for all positive integers @ and n. 


Let @> 1. Prove that 


Prove that 


A Theorem of Ramanujan 


In Theorem 7.3 we computed the mean value of the divisor function d(n). In 
this section we shall determine the mean value of the square of the divisor 
function. We begin with an alternative representation for d?(n). 


238 7. Divisor Functions 


Theorem 7.7 
Bn) = So n(6)ds (3). 


62\n 


Proof. Define the arithmetic function jz as follows: 


~\ _ f w(/n) if nis a square, 
HO) { 0 otherwise. 


By Exercise 1, the function fj is multiplicative. Since the Dirichlet convolu- 
tion of multiplicative functions is multiplicative (Exercise 3 in Section 6.4), 
the function jz * d4 is multiplicative, and 


jixda(n) = (dds (7) 
d|n 
d ul8)\ds (S)- 


62\n 


We shall prove that ji * d4(p*) = (a+ 1)? for every prime power p*. By 


Theorem 7.5, 
4 at+3 
ato") = ("3"). 


jis dulp) =X w(d\as (2) = auln) = (3) =4 


5? |p 


and so 


If a > 2, then 
2 i pe 
juxda(p’) = S> p(d)da (= 
62| pe 
= ds (p*) — dy (p*~”) 
= a+3 if at+1 
- 3 | 
= (a+1)?. 
Since d(p*) = a +1, it follows that 
d*(p") = (a +1)? = ju d4(p*) 
for all prime powers p*. The functions d? and ji*d4 are both multiplicative. 


Since multiplicative functions are completely determined by their values on 
prime powers (Exercise 2 in Section 6.4), it follows that 


d?(n) = ji * da(n) 


for all positive integers n. 


7.2 A Theorem of Ramanujan 


Theorem 7.8 (Ramanujan) 


as LT >}. 


S- a(n) ~ =e (log a)? 


n<ux 


Proof. Applying Theorem 7.6 with ¢ = 4, we obtain 


tlog? x 


D(a) = + O(log? xr). 


By Theorem 7.7 we have 


S> @(n) 


n<ux 


Yul) (J) 


n<a 62|n 

So u(5)da(k) 
O2k<ax 

>> u(6) D5 da(k) 
6< JE k<a/62 

d H(6)Ds (5) 
5<Vz 


(6) log? v 


x u() ) 3 1 2 0 
2 log oD +O)}2 SS 52 108 Z 
5< Vz b< VE 
We estimate these sums separately. The first term is 
3 
( Ven uss ld) HO) o98- ‘zlog' 6° 
=o \ b< VE 
L(6 ee 6 
wd) +O | «log? a S- 
dS Va d<Va 


1 log® 6 
2 O(a) ans? x log? x De = 


6< VE 


+ +0 (a log? os 
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by Theorem 6.17. Similarly, 


1 Ny 1 
x ye 7a oe” 2 < rlog? x S- a < rlog? x. 
b6< VE b< Va 


This completes the proof of Ramanujan’s theorem. 


Exercise 


1. Prove that the function ji is multiplicative. 


7.3. Sums of Divisors 


The arithmetic function o(n) is defined as the sum of the positive divisors 
of n. Thus, 


o(1) = 1 = 1, o(6) = 1424346 = 12, 
o(2) = 142 = 3, o(7) = 147 = 8, 
o(3) = 143 = 4, o(8) = 1424448 = 15, 
o(4) = 14244 = 7, o(9) = 14349 = 13, 
o(5) = 145 = 6, o(10) = 14245410 = 18. 


If n > 2, then o(n) > n+ 1. We can use the standard factorization of n 
to compute a(n). We begin with an example. Consider 180 = 27375. Every 
divisor d of 180 is of the form d = 2%3°5°, where 0 <a < 2,0 <b < 2, and 
0<c< 1. We have 


o(180) = Sod 
d|180 
= 1424344454+64+9+410412 
+15 + 18+ 20+ 30 +36 + 45 + 60 + 90 + 180 


= (1424+4)(14+34+9)(1+5) 


= 546. 
We can compute o(n) in this way for any positive integer n. If d divides n, 
then 
d= |[p”, 
pin 
where 


0 < ap < v(n), 
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and 


a(n) = Sod 


d|n 
vp(n) 
= Tee 
p|n 4p=0 


Vp(n)+1 _ 1 


Ap 


Pp 
p—1 
pin 
This formula expresses o(n) in terms of the standard factorization of n. 


Theorem 7.9 The arithmetic function o(n) is multiplicative. 


Proof. Let m and n be relatively prime positive integers. Since no prime 
divides both m and n, we have 


p-1l 
p|mn 
2 Il pee(m)+1 | Il pee(n+i —1 
7 p-l p-1 
p|m pin 
= a(m)a(n) 


This completes the proof. 


The ancient Greeks divided the positive integers into three classes, de- 
termined by the sum of the divisors of the integer. They called a number 
perfect if a(n) = 2n. A number is called abundant if o(n) > 2n. A number 
is called deficient if a(n) < 2n. The smallest perfect numbers are 


6 = 2-3 = 21(22~-1), 
28 = 4-7 = 22(23-1), 
496 = 16-31 = 24(25-1), 
8128 = 64-127 = 26(27-1). 


Theorem 7.10 (Euler) An even integer n is perfect if and only if there 
exist prime numbers p and q such that 


q=2?-1 


and 
n= 2P-1q. 
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Proof. If n is of this form, then q is odd and 2n = 2?q. It follows that 


o(2?~")o(q) 
(2? — 1)(q+1) 
2Pq + (2? —q-1) 
2n 


I 


o(n) 


l 


9 


and so n is perfect. 
Conversely, if n is an even perfect number, then o(n) = 2n. Writing n in 
the form 
where m is odd and k > 2 (since n is even), we have 
2*m = 2n = a(n) = 0(2*-1m) = 9 (2°77) o(m) = (2* — 1)o(m). 


Since 2" —1 divides 2*m and 2" —1 is relatively prime to 2", Euclid’s lemma 
implies that 2* — 1 divides m, and so 


m= (2*-1)é 
for some odd integer ¢. Then 
2* (2* —1)¢= (2° -1) 0 ((2°-1) 2). 
If > 1, then 1,2, and (2* — 1)@ are distinct divisors of (2* — 1), and 
2ke=o ((2*-1) 2) >14+ 24 (2° -1)e=2%e4+1, 
which is impossible. Therefore, 2 = 1 and 


2* = o(2*-1)=14+(2*-1)+ So a, 
a|(2k 1) 
1<d<2k-1 
it follows that 2*—1 has no proper divisors, that is, 2*—1 is a prime number. 
If the exponent k were composite, then k = kiko with 1 < ky < ko < k, 
and 


at —1 = (2h) —1 = (2h 1) (14.28 4:2 4... poke) 


would be composite, which is false. Therefore, k = p is also prime, and 
m = q = 2? — 1. This completes the proof. 


A prime number of the form 2? — 1 is called a Mersenne prime. (Ex- 
ercise 5 in Section 1.5 and Exercise 9 in Section 3.4 are about Mersenne 
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primes.) By Theorem 7.10, every even perfect number is uniquely associ- 
ated with a Mersenne prime. Only finitely many Mersenne primes have 
been discovered, so we know only finitely many even perfect numbers. A 
list of all Mersenne primes known in October, 1999, appears in the Notes 
at the end of Chapter 1. 

It is an unsolved problem to decide whether there exist infinitely many 
even perfect numbers. We know almost nothing about odd perfect numbers, 
and it is an unsolved problem to decide whether even one odd perfect 
number exists. 

Let 


o*(n) =o(n) -—n= S- d. 
den 

We define o*(0) = 0. A pair (m,n) of positive integers is called an amicable 
pair if 

a*(n) =m 
and 

o*(m) =n. 
Equivalently, (m,n) is an amicable pair if o(m) = o(n) = m+n. For 
example, the pair (220,284) is amicable, since 


o* (220) = 284 


and 
o* (284) = 220. 


It is not known whether there exist infinitely many amicable pairs. 
For every positive integer n and nonnegative integer k, there is an integer 
S;,(n) obtained by iterating the function o* as follows: 


So(n) = n, 
Si(n) = o*(n), 
So(n) = o*(Si(n)) = o* (o*(n)), 


Srii(n) = o*(Sz(n)), 


for all positive integers k. The sequence {.5;(n)}?°., is called the aliquot 
sequence of n. Since there exist abundant, perfect, and deficient numbers, it 
can happen that S;41(n) > S,(n), Sp4i(m) = Sz(n), or Sp4i(nr) < Se(n), 
and so the aliquot sequence can oscillate up and down. Computations indi- 
cate, however, that for small n the aliquot sequence always becomes even- 
tually periodic. For example, the aliquot sequence for 12 is 


12,16, 15,9,4,3,1,0,0,.... 
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If n is a perfect number, then 5;,(n) = n for all k, and the sequence 
{Sp (7)}?2p is constant. If (m,n) is an amicable pair of integers, then 
So(n) = n, 
Si(n) = m, 
So(n) = n, 
S3(n) = m, 


and so on. Thus, the aliquot sequence for an integer in an amicable pair 
oscillates with period 2. It is an unsolved problem to determine if, for every 
positive integer n, the sequence {S;(n)}?2p9 is eventually periodic. This is 
called the Catalan—Dickson problem. 

There is a natural generalization of the “sum of the divisors” function. 
For any real or complex number a, we can define the arithmetic function 


To(n) = a d™. 
dl 
d>1 


Then oo(n) is the divisor function d(n), and o1(n) = o(n). The function 
0 (n) is multiplicative for every number a (Exercise 8). 


Exercises 
1. Compute o(n) for 11 <n < 20. 
2. Prove that (17296, 18416) is an amicable pair. 
Hint: 17296 = 2* x 23 x 47 and 18416 = 24 x 1151. 
3. Prove that (9,363, 584, 9,437,056) is an amicable pair. 
Hint: 9,363,584 = 2” x 191 x 383 and 9,437,056 = 27 x 73727. 


4. Let A be a set of positive integers, and let A(x) denote the number 
of elements a € A such that a < x. The set A has asymptotic density 
a if limg.. A(x)/x = a. Prove that the set of even perfect numbers 
has asymptotic density zero. 


5. Prove that o(n) = no_1(n) for every positive integer n. 


6. Prove that ae 
0< 2 = < a_1(n) logn. 


7. Prove that for every number a, 


] yy 
De a c= o(a—1(n) log® n) . 
d\n 


10. 


11. 


12. 


13. 


14. 
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Hint: Observe that for any <¢ > 0, 


log* d elog* n log® d ~ , a(n) log* n 
Sy Bed gy Se ye Be oo (nplogt ny ODE 
als dene dove 


and apply Theorem 7.2. 


. Prove that the function o,(n) is multiplicative for every real or com- 


plex number a. 


. Let a > 1. Prove that 


n® < a(n) < C(a)n® 


for all positive integers n. 
Hint: Dadle d* = Van(n/d)®. 
Let a > 1. Prove that 


for every integer n > 2. 


Prove that 
nae OO, 
noo n 
Let x > 2 andn=[J,<, p. Prove that 
a(n) 1 
eae 3 = 
pXu 


Remark. Theorem 8.7 implies that limsup,_,,, 7(n)/n = oo. 


Consider the numbers 


a@ = 12,496 = 24x11x 71 
a, = 14,288 = 24x19x47 
@, = 15,472 = 24x 967 
aj = 14,536 = 23x 23x79 
ag = 14,264 = 23 x 1783. 


Prove that if r € {0,1,2,3,4} and k=r (mod 5), then 
S;,(12, 496) = a,, 
and so the aliquot sequence for 12,496 is periodic with period 5, 


Compute the aliquot sequences {S;,(n)}?29 for n = 28, 29, 30, 31, 32. 
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7.4 Sums and Differences of Products 


In this section we prove two theorems of Ingham about sums and differences 
of divisor functions. These results have beautiful interpretations in terms 
of the number of solutions of diophantine equations in positive integers. 

Let V(n) denote the number of representations of n as a sum of products 
of two positive integers. The function V(n) counts the number of solutions 
in positive integers of the diophantine equation 


n= ab+ cd. (7.1) 


Let cd = k. Then 1 < k < n—1 and n—k = ab. Since the number of 
solutions of k = cd is d(k) and the number of solutions of n — k = ab is 
d(n — k), it follows that the number of solutions of (7.1) with cd = k is 
d(k)d(n — k), and so 


Consider the diophantine equation 
£=ab-—cd. (7.2) 


For every positive integer k, the number of solutions of (7.2) with cd = k 
and ab = k+ @ is d(k)d(k + £). Let Ug(n) denote the number of solutions 
of (7.2) in positive integers with cd = k < n. Then 


Uz(n) = o d(k)d(k + 0). 
k=1 


We need the following lemma. 


Lemma 7.1 For every x > 1, 


Proof. We define F 
1; a ae (7.3) 


and 


7.4 Sums and Differences of Products 
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If st < 2 and r is a common divisor of s and t, then r? < st < x, and so 


r</x and 


f(x) 


g( 


l 


by Theorem 6.17. Since 


and 


D 


gS = 


nS@  r<(«/n)*/? 


ee 


ler) 


d(n) log? a 
2 


2 


n<ux 


by Exercises 13 and 14 of Section 7.1, it follows that 


f 


6 d(n) , 1 d(n) 
m2 n- G gi/2 d. mi/2 


+ O(log x) 


d 
iy) = 2n1/? log x + O(a'/) 
nt/ 


(x) = 2. log? a + O (log x). 
7 
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This completes the proof. 


Theorem 7.11 


n—1 


V(n) = So d(k)d(n = k) ~ So(n) log? n. 


k=1 


Proof. The arithmetic function V(n) is the number of solutions of the 
equation n = ab+ cd in positive integers. If (a,b, c,d) is a solution of this 
equation, then 


(ab+cd)* (ab— cd)? - nr 


ac: bd= Z m rie 


and so ac < n/2 or bd < n/2. Let P denote the number of solutions with 
ac < n/2, let Q denote the number of solutions with bd < n/2, and let R 
denote the number of solutions with both ac < n/2 and bd < n/2. Since 
(a, b, c,d) is a solution if and only if (b,a,d,c) is a solution, it follows that 
P=Q and 

V(n) =P+Q-R=2P-R. 


We first compute P. For fixed positive integers a and c, let ®(a,c,n) 
denote the number of solutions of the equation ab + cd = n in positive 
integers b and d. Then 


P= oS ®(a,c,n). 


ac<n/2 


Let r = (a,c) denote the greatest common divisor of a and c. If r does not 
divide n, then ®(a,c,n) = 0. Therefore, we can assume that r divides n, and 
there exist positive integers a,y, and 7 such that a= ra,c=ry,n=rn, 
and (a,7) = 1. Moreover, ®(a,c,n) = ®(a,¥, 7). 

Since (a, y) = 1, there exist integers bp and dp such that abo + ydo = n, 
and every solution of the equation ab + cd = n is of the form b = bp + yh 
and d = dy — ah for some integer h. It follows that every solution of the 
equation ab + cd = n is of the form b = bb) + yh and d = dy — ah for some 
integer h. If b > 0 and d > 0, then 
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where |¥| < 1 (Exercise 2). We have 


P= SS ®(a,c,n) 


= ye S- ®(a, 7,7) 


(a,y)=1 


rln ay<n/2r2 
(a,y)=1 


1 1 
rln eo: y ac<n/2 
3 
= © + (5 (ve Fs) +0 (lor s"3) ) +0 a) 
rin k<n/2 


2 
a. en : (log ie ) O (no_1(n) log n) + O (nlogn) 


5 ie ome 


3 
= —yno—1(n) log? n +0 (no_1(n) log? n) + O(a(n) logn) 


= a(n) log?n +0 (a(n) log? n), 
by Lemma 7.1, Theorem 7.3, and Exercises 5 and 7 in Section 7.3. 

Next we compute R. For fixed integers a and c, the linear diophantine 
equation ab + cd = n is solvable in integers if and only if n is divisible by 
r = (a,c). Again we write a = ra,c = ry, and n = rn, where (a,7) = 1. If 
the integers bp) and do solve the equation ab + cd = n, then every solution 
is of the form 

b= bo + hy 


and 
d=dy—ha 


for some integer h. 
Let a and c be positive integers with ac < n/2. Let U(a,c,n) denote the 
number of solutions of the equation ab + cd = n in positive integers b and 

d with 
bd < 


Nl s 
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Then W(a,c,n) = V(a,7,7) counts the number of integers h such that 


bo + hy > 0 and dy — ha > 0, (7.4) 
and s 
(by + hy)(do — ha) < %. (7.5) 
We define the rational number 
a(bo + yh) 
u = ————_.. 
n 
Then 
1 ae c(do = ah) 
n 
Inequalities (7.4) imply that 
O<u<l. 
Inequality (7.5) implies that 
1 
u(l—u) < a <-. 
nA 
Solving this quadratic inequality, we obtain 
= 
O<u< > (7.6) 
and ; 
= <u<l, (7.7) 
where 
2 
peta 
n 


Note that 0 < v <1, since 0 < ac < n/2. Inequality (7.6) is equivalent to 


by ye (l=e)nr bo 
oy 2ac Y 


9 


and inequality (7.7) implies 


1 
0<l-u< 
< oa 


which is equivalent to 
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Both of these intervals have length 


(1 —v)nr z (1 — v?)nr ¥ (1 — v2)nr = 


2ac (1+ v)2ac — 2ac 


It follows that if a and c are positive integers with (a,c) < n/2 and (a,c) = 
r, then 


U(a,c,n) = ie ls + O(1) < 2r+O(1). 


ac 
Therefore, 
R= S- W(a,c,n) 
ac<n/2 
= S- Sou (a,c, n) 
rln ac<n/2 
(a,c)=r 
< SS) SO @r+0(1)) 
rin ay<n/(2r?) 
(a,y)=1 
= 2y)r YO 14+ S° OW) 
rin ay<n/(2r?) ac<n/2 
«K« Sor SO dk)+ YO dk) 
rin k<n/(2r?) k<n/2 
nlogn 
l 
< or ( = ) + ntogn 
< no_1(n)logn 
= o(n)logn. 
We have 
V(n) = 2P-R 
6 ? 
= ~20(n) log? n + o(a(n) log? n) + O(a(n) log n) 
6 ‘ 
~ =20(n) og” n. 
This completes the proof. 


Theorem 7.12 For every positive integer @, 


= So d(k)d(k + 8) ~ So 1(£)nlog? n. 


k=1 
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Proof. Let x be the geometric mean of n and n+ @, that is, 


a= V/n(nt+0=n+8, 


where 2 
0<0< 3° 


We have x = O(n). 
The function Up(n) counts the number of 4-tuples (a, b,c, d) of positive 
integers such that 


ab—cd=¢ and cd <n. (7.8) 
If (a,b, c, d) satisfies (7.8), then 
ac-bd<n(n+£) = 2", 


and so ac < x or bd < x. Let P be the number of solutions of (7.8) with 
ac < x, Q the number of solutions of (7.8) with bd < x, and R the number 
of solutions of (7.8) with both ac < x and bd < «x. The symmetry of 
equation (7.8) implies that P = Q, and so 


Up(n) = P+Q-—R=2P-R. 


We shall find asymptotic formulae for P and R by the same method used 
in the proof of Theorem 7.11. 

We first compute P. For fixed positive integers a and c, let ®¢(a,c,n) 
denote the number of solutions of the equation ab — cd = ¢ in positive 
integers b and d with cd < n. Let r = (a,c) denote the greatest common 
divisor of a and c. The integer r must divide @, and so there exist positive 
integers a,y, and such that a = ra,c = ry,é = rd, and (a,y) = 1. If 
cd <n, then yd < n/r. If ab — cd = @, then ab — yd = X. If ac < a, then 
ay < «/r?. Therefore, @p(a,c,n) = ®)(a,y,n/r) and 


P= SS ®,(a, Cy n) = S- S- ®y(a,7,n/r). 


acca rll aysa/r2 
(a,y)=1 


Since (a, y) = 1, there exist integers bp and dp such that abo — ydo = A, 
and every solution of the equation ab — yd = 4 is of the form b = bo + yh 
and d = dy + ah for some integer h. It follows that every solution of the 
equation ab — cd = £ is of the form b = bp + yh and d = do + ah for some 
integer h. If d > 0 and cd < n, then b > 0 and 

sp or Og Dn (7.9) 
a 


ac a ray a 


Conversely, if h satisfies (7.9), then b and d are positive integers with cd < n. 
Therefore, 


e(a,¢,n) = O(a, 7.0/1) = +8. 
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where |v| < 1. We have 


S- ys ®y(a, Ys n/T) 


rll ay<a/r? 
(a,y)=1 


-> > (e+) 


rll ay<a/r? 


P 


(a,y)=1 
rlé : eens acca 
= © (% (5) +0 (002) +0 (aw 
rlé k<a 
- 2 re (lox 5) O (no_1(n) log n) + O (a log x) 


a, ae * (log). O (a(n) log n) 


3 
= —20(n) log* n + 0 (a(n) log” n) , 


by Lemma 7.1, Theorem 7.3, and Exercises 5 and 7 in Section 7.3. 

Next we compute R, which is the number of solutions of (7.8) with both 
ac < x and bd < «x. For fixed positive integers a and c, we let V(a,c, 0) 
denote the number of ordered pairs (b,d) of positive integers such that 
ab — cd = ¢ and 


0<d< 


mand) = obd <a. 
Cc 


If r = (a,c), then a = ra and c = ry, where a and ¥ are relatively prime 
positive integers. If r does not divide @, then U(a,c, 2) = 0. If r does divide 
é, then @ = rr and W(a,c,0) = V(a,y,.). Since (a,y) = 1, there exist 
integers bp and dp such that 


abo ome! ydo = r, 


and every integral solution of the linear diophantine equation ab — yd = A 
is of the form 
b=bo+y7h and d=dy+ah 


for some integer h. Every solution in integers of ab — cd = £@ is of the form 


b=bo+yh and d=dy+ah 
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for some integer h. The inequality 0 < cd < n implies that 


Since 


bo do a abo — ydo - r 
y @ ay ay 


>0, 


it follows that * , 
0<—+h< ah: 


If bd = (bo + yh) (do + ah) < x, then 


2 
(2+n) < (t+) (S4n)c 
a ay a ay 


and so 
Ge Soe 
ay 
Therefore, 
WG oie sa 0 
ay 
and 
R= S- U(a,c, &) 
ac<cux 


T 
M 
iM 
= 
& 
Qo 
S 


< 2 
(a,y)=1 
cowie DD Gi 
re ae 
= 2a), ye 
re eee VO 
< a . 
rlé 
< «sloga, 


by Exercise 14 in Section 7.1. This completes the proof. 
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Exercises 


1. Prove that the diophantine equation (7.2) has infinitely many solu- 
tions in positive integers. 


2. Let x and y be real numbers with x < y. Prove that the number of 
integers in the open interval (x,y) is y— 2+, where |0| < 1. 


7.5 Sets of Multiples 


Let A be a nonempty set of positive integers. The set of multiples M(A) 
consists of all positive multiples of elements of A, that is, 


M(A)={ma:a€ Aand me N}. 


The set B is called a set of multiples if B = M(A) for some set A. For 
example, if A = {2}, then M(A) is the set of positive even integers. If P is 
the set of prime numbers, then M(P) is the set of all integers n > 1. 

A nonempty set A of positive integers is called primitive if no element of 
A divides another element of A, that is, if a,a’ € A and a divides a’, then 
a = a’. If A; and Ag are nonempty sets of positive integers and A, is a 
subset of Ag, then M(A;) is a subset of M(Ag2). If Ag is primitive and A, 
is a proper subset of Az, then, by Exercise 4, M(Aj,) is a proper subset of 
M(Ao). 

We shall prove that if B is a set of multiples, then there exists a unique 
primitive set A* such that B = M(A*). 


Lemma 7.2 Let A be a nonempty set of positive integers, and let A* be 
the subset of A consisting of all integers a € A not divisible by any other 
element of A. Then A* is a primitive set, and 


M(A) = M(A*). 


Proof. The primitivity of the set A* follows immediately from the defi- 
nition. 

If b € M(A), then b is a multiple of a for some integer a € A. If a Z A*, 
then a has a proper divisor that belongs to A. Let a’ be the smallest element 
of A that divides a. Then a’ € A*, and 6 is a multiple of a’. This completes 
the proof. 


Lemma 7.3 If A, and Ag are nonempty sets of positive integers such that 
M(Aj) —_ M (Ag), then M(Ai N Ap) = M(Aj). 
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Proof. By Exercise 4, M(A1M Ag) is a subset of M/(A1). If M(A1N Ag) is 
a proper subset of M(A;), then there exists a smallest integer b € M(A;) \ 
M(Ai NM Ag). Since b € M(Aj1) = M (Ag), we have 
b= ™m 1a, = Mga 
for positive integers m1, M2, @1,@2 with a, € Aj, ag € Ag. Moreover, a; # 
az since b ¢ M(A,M Ag). Suppose ay < ag. Since a; € M(Aj) and 


ay < a2 <™M2a2 = b, 


the minimality of b implies that a € M(A,/M Ag). Then a, = ma for some 
a € Ay Ag, and so b = mya, = myma € M(A,N Ag), which is absurd. It 
follows that M(A,) = M(Ai/N Ag). 


Theorem 7.13 Let B be a set of multiples. There exists a unique primitive 
set A* such that B= M(A*). 


Proof. Let B = M(A) for some set A, and let A* be the primitive subset 
of A constructed in Lemma 7.2. Then B = M(A*). Let A’ be any set of 
positive integers such that B = M(A’). By Lemma 7.3, 


B= M(A’)) = M(A'N A*) = M(A*). 


Since A’N A* is a subset of A*, it follows from Exercise 4 that A’N.A* = A*. 
Thus, A* is a subset of every set A’ such that M(A’) = B, and so A* is 
the primitive set uniquely defined by 


A= [) A. 


A'CN 
M(A’‘)=B 


This completes the proof. 


Let A be a set of integers. The counting function A(x) of the set A counts 
the number of positive elements of A not exceeding x, that is, 


A(z) = SO 1. 


acA 
l<a<a 


The lower asymptotic density of A is 


Fe er ee 


D000 x 


The upper asymptotic density of A is 


dy (A) = lim sup a) 
x 


zrz—0o 
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The set A has asymptotic density d(A) = a if dz(A) = dy(A) = a, or, 
equivalently, 


asin OO, 


mZ—co 6 


The set of multiples of a finite set of positive integers always has an asymp- 
totic density (Exercise 6), but it is possible to construct an infinite set A 
such that 1/(A) does not have an asymptotic density. The following result 
gives a sufficient condition for the set of multiples of an infinite set to have 
asymptotic density. 


Theorem 7.14 If A is an infinite set of positive integers such that 
1 
yl ex 
acA . 
then the set of multiples of A has an asymptotic density. 


Proof. Let A = {a;}%2,, where a, < ag <-:-, and let B = M(A). For 
every positive integer k, let B; denote the set of all positive integers that 
are divisible by ax, but not divisible by a; for all i < k. The sets By are 
pairwise disjoint, and B = UZ? Bx. It follows that 


and 


k=1 
for all « > 1. There are [x/a,] positive integers not exceeding x that are 
divisible by az, and so 


0 < B,(2) < [=| oe 


ak 
Equivalently, 
xv Qk 
for all > 0. Let e > 0, and choose Ky = Ky(e) such that 


O< 


So —<e 
k=K, 41" 
Then 
ky foe) foe) 
0< B(x) yo) _ S- By() 2 S- Eames 
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By Exercise 8, the set B;, has an asymptotic density, that is, there exists a 
number @; > 0 such that 


d(B,) = lim Bley. g. 


D000 x 


Moreover, 3; = d(B,) = 1/a; > 0. For every positive integer @, the density 
of the set of integers divisible by at least one of the integers a,,...,a¢ is 
Gi +--++ Be, and so 


L 
O0<S°& <1. 
k=1 


Therefore, the infinite series )77°., Gg converges to some number 3 > 0. 
We shall prove that the set of multiples 1/(A) has density (, that is, 


ie y= 6. 


Zo 


For every € > 0 there exists an integer Ky = Ko(e) such that 


By(2) E 
| is Br| < K 
for all x > a and k=1,...,K. Then 
Bla By (a 
x a 
K K 
By(2) 
< a x —- >> Be + 2e 
k=1 k=1 
K 
By(a) 
< — 2 
So, r Br| + 2€ 


This completes the proof. 


The following result will be used in Section 7.6 to prove that the set of 
abundant numbers has an asymptotic density. 
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Theorem 7.15 If A is an infinite set of integers with counting function 


= (<2,) 


for x > 2, then the set of multiples M(A) has an asymptotic density. 


Proof. By Theorem 6.10, the infinite series }7, <4 a~' converges. It fol- 
lows from Theorem 7.14 that the set of multiples 1/(A) has an asymptotic 
density. 


Exercises 
1. Prove that if 1 € A, then M(A) =N. 


2. For every positive integer n, prove that the set {n+1,n+4+2,...,2n} 
is primitive. 


3. Let Q(n) denote the total number of prime factors of n. For every 
r > 1, prove that the set {n >1:Q(n) =r} is primitive. 


4. Prove that if A; and Ag are nonempty sets of positive integers and 
A; C Ag, then M(A1) C M(Az2). Prove that if Ag is primitive and A; 
is a proper subset of Ag, then M(Aj) is a proper subset of M(Ag2). 


5. Prove that if A is a primitive set, then A has upper asymptotic density 
dy(A) < 1/2. 
Hint: Let A = {a;}92,, where a, < ag < a3 < --+. Prove that each 
a; can be written uniquely in the form a; = 2“‘v;, where u; > 0 and 
v; is an odd positive integer. Prove that the numbers v; are distinct, 
since the set A is primitive. 


6. Let « > 1. Let A= {a1,..., ax} consist of k distinct positive integers. 
For every subset A’ = {aj,,...,a;,} C A, let N(a, A’) denote the 
number of integers up to x divisible by every element of A’. Prove 


that 
N(a, A’) = | ——. 
ad mcm ; 
where Iem(A) = [a;,,...,a@;,] is the least common multiple of the 


integers in A’. Prove that the number of integers up to «x that are 
divisible by no element of A is 


j=0 A'CA j=0 A'CA 
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Let B = M(A) and let B(x) be the counting function of B. Prove 


that 
. ; x 
B(a) = ae pS Fad 
: | |AN=3 
= 2) (-197 SO acy eo 
AS Ae 


Deduce that the set of multiples 1/(A) has asymptotic density 


k 
d(M(A)) = S(-1)? SO ETO 


j=1 Cc 
|A SJ 


7. Let A = {a1,...,a%} consist of k pairwise relatively prime positive 
integers. Prove that 


8. Let A = {a,,...,a,} consist of k distinct positive integers, and let 
B,, be the set of positive integers divisible by a, but not divisible 
by a, for all i < k. Prove that the set By, has an asymptotic density 
d( By), and compute d(B,). 


7.6 Abundant Numbers 


In this section we consider the set of perfect and abundant numbers. For 
simplicity, we modify our previous terminology and call the elements of 
this set abundant. Now a positive integer n is abundant if o(n) > 2n. By 
Exercise 2, if n is abundant, then every multiple of n is also abundant. 

An integer n is called a primitive abundant number if n is abundant but 
no proper divisor of n is abundant, that is, a(n) > 2n but o(d) < 2d for 
every proper divisor d of n. The set of abundant numbers consists of all 
multiples of the primitive abundant numbers (Exercise 3). We shall prove 
that the set of abundant numbers possesses an asymptotic density. 

An integer n will be called a k-abundant number if o(n) > kn. Let Ax 
be the set of all k-abundant numbers. 

A primitive k-abundant number is a positive integer n such that o(n) > 
kn, but o(d) < kd for every proper divisor d of n. Let PA, denote the set of 
primitive k-abundant numbers. Then A; = M(PA,), that is, A, is the set of 
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multiples of PA;. We shall prove that the set A, has an asymptotic density 
for every integer k > 2. By Theorem 7.15, Ax will have an asymptotic 
density if the counting function of the set PA; of primitive k-abundant 
numbers is O(x log”? 2). 

We begin with some lemmas about prime divisors. The first result states 
that it is rare for an integer to be divisible by a large prime power. 


Lemma 7.4 The number of positive integers n < x divisible by some prime 
power p" > log* «x with r > 2 is O(xlog~? 2). 


Proof. If p is a prime such that p > log? x and p? divides n, then n is 
divisible by a prime power p” > log* x with r > 2. The number of such 
integers n < a is [a/p?]. 

If p < log’ a, let Up be the least integer such that p“? > log’ «. The 
number of integers n < x divisible by a prime power p” > log’ z is [x /p™?]. 

Let N(x) denote the number of integers n < ax divisible by a prime 
power p” > log* xz. Then 


no < EEE la 


IA IA 
8 8 
=| a 3; | = 
+ + 
WO On OS 
a 8 ‘n| 8 
a “——" 

a 

08 

ne a4 
ar 


< — 


This completes the proof. 


The next result states that it is rare for a number to have many distinct 
prime divisors or to have only small prime divisors. Let w(n) denote the 
number of distinct primes that divide n. Let P(n) denote the greatest prime 
divisor of n. 


Lemma 7.5 Let x > e© and y = loglogx. The number of positive integers 
n<« such that either w(n) > 5y or P(n) < x!/) is O(xlog~? x) for all 
sufficiently large x. 


Proof. Let No(x) denote the number of positive integers n < a with 
w(n) > 5y. By Exercise 9 in Section 7.1, 


x x 
(log y)blog2-1 = log? 7 : 


No(a) < 
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Let p be a prime. If p” < x, then 0 < r < loga/logp < log x/ log 2, and so 
the number of prime powers p” < « with p < x!/(6Y) does not exceed 


(1 ie ves) 2 6v) < GU Jog x. 
log 2 


Let N3(a) denote the number of integers n < x such that w(n) < 5y and 
P(n) < a/(), Then 


5y 
N3(") < (2ifev) log x) K 3 
log* x 


for all sufficiently large x. 


Combining Lemma 7.4 and Lemma 7.5, we obtain the following result. 


Lemma 7.6 There are only O(xlog~? x) integers n < x that fail to satisfy 
all of the following three conditions: 


(i) If p” divides n and r > 2, then p” <log* z. 

(ti) w(n) < By. 

(iti) P(n) > a/6), 
Lemma 7.7 Let n < « be a primitive k-abundant number satisfying con- 
ditions (i), (ti), and (iii) of Lemma 7.6. Then n is divisible by a prime p 


such, that 
logt x < p< l/(8y), (7.10) 


Proof. If not, then we can write n = ab, where a is a product of primes 
less than log* x, and 6 is a product of primes greater than x!/(3”). Since 
al/(139) < a1/(6¥), condition (iii) implies that b > 1. 

By condition (ii), w(b) < w(n) < 5y. Then 


sit) < H(1+2+5+-) 


P Pp 


IA 
—N 

ee 
bb 
NV 


2 w(b) 
= (1 ns aS} 


oe NPP 
: (1+ saan) 


20y 
xi/(13y) 


7.6 Abundant Numbers 263 


if x is sufficiently large (by Exercise 4 with c = 2). Every prime that 
divides a is less than log’ x, and, by condition (i), every prime power that 
divides n, and hence a, is also less than log* x. Since w(a) < w(n) < 5y by 
condition (ii), it follows that 


1<a< (log*x)*¥ = (log x). 


By condition (iii), b > 1, and so a < n. Since a is a proper divisor of the 
primitive k-abundant number n, we have 


a(a) < ka. 
Since *& is an integer, we have 
a(a) <ka—1, 
and so ais i ‘ 
a Sk . F (log x)20y" 


Since o(n) is multiplicative and n = ab with (a,b) = 1, we have, for x 
sufficiently large, 


a(n) _ a(a) o(b) 
n - a b 
1 20y 
= (: = (og) (: ? ant 
pa 2OkY 1 


"gl /(13y) (log x)20y 
<_ek, 


which is impossible, since the integer n is k-abundant. Therefore, n must 
be divisible by a prime p in the interval (7.10). 


Lemma 7.8 If x is sufficiently large and n < x is a primitive k-abundant 
number satisfying conditions (i), (ti), and (ti) of Lemma 7.6, then 
a(n) k 

n gil (6y) : 


Proof. By condition (iii), the integer n is divisible by a prime p such 
that 
p> P(n) > ov), 
Since p? > «!/@¥) > log’ x for x sufficiently large, condition (i) implies that 
p* does not divide n. Therefore n = mp, where (m,p) = 1 and o(m) < km 
since n is primitive k-abundant. It follows that 


1 k 
BE sce al) <k(1+2) <k+ sas. 
n m p p x /(6y) 
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This completes the proof. 


Theorem 7.16 For every integer k > 2, let PAy(a) denote the number of 
primitive k-abundant numbers not exceeding x. Then 


PA;,(2) < — 
log* x 


and the set Ay of k-abundant numbers possesses an asymptotic density 
Proof. By Lemma 7.6 there are only O(2 log~* x) primitive k-abundant 
integers that fail to satisfy conditions (i), (ii), and (iii) of Lemma 7.6. 
Let t be the number of primitive k-abundant integers n < x that do 
satisfy these three conditions. We denote these numbers by nj,...,n1. By 


Lemma 7.7, corresponding to each integer n; there is a prime p,; such that 
p; exactly divides n; and 


log* x <p < gil (sy) 


Let nj; = pym;. Then (p;,m;) = 1 and 


a 
1 < MM; < a 
log” x 
It suffices to prove that the integers m, are distinct. 
Suppose that m; =m, for some i 4 j. Then p; 4 p;. Since 


a(n) — (pie +1) a(m) 


uz Pi M4 


and 
a(nj) _ (py +1) o(mi) 
Nh; Pj MN; , 


it follows that 

a(ni)n; _ (pit 1p; 

nio(nj) — pi(pj +1) 
Since p; and p, are distinct primes, it follows that (p; + 1)p; 4 pi(p; + 1). 
We can assume that (p; + 1)p; > pi(p; +1), and so 


a(ni)n; = (pi +1); 
nio(n;) pi(p; + 1) 
aes = te 
pi(p; + 1) 
1 
2 1+ a7G3a)(gt/ a) 4 1) 
2 il : 


" 992/(13y) ° 
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By Lemma 7.8, 


o(nin; (,, k& \12,, 1 
njo(n;) > (: gi/(6y) ) k <14 gi/(6y) ° 
This is a contradiction, since 


22/139) & gl/(6y) 


for all sufficiently large x. It follows that the numbers mj,,...,m, are dis- 
tinct, and so t < alog~* x. This completes the proof. 


Exercises 


1. Prove that 120 is a 3-abundant number. 
2. Prove that o(rn) > ro(n) for every r > 2. 


3. Prove that every abundant number is a multiple of a primitive abun- 
dant number. Prove that every k-abundant number is a multiple of a 
primitive k-abundant number. 


4. Prove that for every c > 1 there exists a number 69(c) > 0 such that 
for all u > 0 and v > 0 with wv < do(c), 


(1+u)® <1+cuv. 


7.7 Notes 


Ramanujan stated Theorem 7.8 in [121]. Wilson [157] published a proof 
of this result. Ingham [69] proved Theorems 7.11 and 7.12. Johnson [75] 
generalized Theorem 7.11 to sums of any finite number of products. He 
proved that for any integer s > 2, the number of solutions in positive 
integers of the diophantine equation 


N= LY H+ + LeYs 
is asymptotic to 
ds—1(n) log® n 
(s—1)!¢(s) ° 
where ¢(s) is the Riemann zeta function. 


Besicovitch [10] constructed the first example of a set of multiples that 
does not have asymptotic density. 
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Theorem 7.16 on the asymptotic density of the abundant numbers was 
proved independently by Chowla [16], Erdés [31], and Davenport [19]. The 
proof in this book is due to Erdés. For refinements and generalizations of 
this result, see Elliott, Probabilistic Number Theory I [28, Theorem 5.6]. 

There are excellent research monographs on many of the topics in this 
chapter, for example, Halberstam and Roth, Sequences [48, Chapter 5], 
Hall, Sets of Multiples [49], and Hall and Tenenbaum, Divisors [50]. Dick- 
son [25, Vol. I, Chapter I] is a historical catalog of results on perfect, abun- 
dant, deficient, and amicable numbers. 


8 


Prime Numbers 


8.1 Chebyshev’s Theorems 


Let (x) denote the number of prime numbers not exceeding 2, that is, 


T(x“) = yl 


pXau 


is the counting function for the set of primes. Euclid proved that there are 
infinitely many primes, or, equivalently, 

lim (x) = oo. 

@w— Co 
A classical problem in number theory is to understand the distribution of 
prime numbers. This problem is still fundamentally unsolved, even though 
we know many beautiful results about the growth of (x) as x tends to 
infinity. In this chapter we shall show that the order of magnitude of 1(z) 
is z/log a. In Chapter 9 we shall prove the prime number theorem, which 
states that (x) is asymptotic to x/log x, that is, 


We introduce the Chebyshev functions 


Ha) = So logp = log [[ p 


pSu pSu 
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and 


B(x) = D> loge. 


pe<a 
For example, 
0(10) = log 2 + log3 + log5 + log7 


and 
wW(10) = 3log 2 + 2log3 + log 5 + log 7. 


The functions J(x) and 7(x) count the primes p < x and prime powers 
p® <a, respectively, with weights log p. Clearly, 


V(x) < ¥(2). 
If p* < «, then k < [log x/log p], and so 


v@) = >) lbgp=y_ | Yo 1 togp = > | ES] toe p 


pk<a psx \ pk<e« psx 
k>1 k>1 
< y log x = m(x) log x. 
psu 


Chebyshev proved that the functions J(x) and w(a) have order of magni- 
tude x and that a(a) has order of magnitude x/ log x. 

Before proving these theorems, we need two results about binomial co- 
efficients. The first lemma states that for fixed n, the sequence of binomial 
coefficients (7) is unimodal in the sense that it is increasing for k < n/2 and 
decreasing for k > n/2. In the second lemma we apply the binomial theo- 
pan to obtain upper and lower bounds for the middle binomial coefficient 
es. 


Lemma 8.1 Letn>1 and1<k<n. Then 
n n 5 ° n+l 

ce < (;) if and only ifk < ">, 

n n i . n+l 

ee) (;) if and only ifk >", 


Me — {”) ; eG = ntl 
oy — (;) if and only if n is odd and k = ">. 


V 


Proof. Consider the ratio 


r(k) = Ca H=e} _ (kK-1in-k+ 0)! _ n-k+) 


(71) oe ki(n — k)! k 


Then r(k) > 1 if and only if k < (n+ 1)/2, and r(k) < 1 if and only if 
k > (n+1)/2. 


8.1 Chebyshev’s Theorems 


Lemma 8.2 For all positive integers n, 


2n 
2 < 60 on. 
2n n 


Proof. By the binomial theorem, 


2n 


a2 — (141)"=5) > @ > ea 


k=0 
By Lemma 8.1, the middle binomial coefficient (ie 
coefficient in the expansion of (1 +1)?”. Therefore, 
2n 2n-1 
2n 2n 
OF = rl 1 
> ( k 2s ( k . 


k=0 


aa (") 


< 2n 64 : 
n 


This completes the proof. 


Theorem 8.1 For every positive integer n, 


[[ ><a". 


pan 
Equivalently, for every real number x > 1 
O(a) < wlog4. 


Proof. Let m > 1. We consider the binomial coefficients 


uw = (eer ‘) 7 ) 
= (2m + 1)2m(2m — 1)(2m — 2)--+(m +2) 
m! ; 


This is an integer, since M is a binomial coefficient. Moreover, 
2 1 2 1 
eae ety 
m m+1 
ss 2m +1 
k 
k=0 
g2m+1 


A 
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) is the largest binomial 


(8.1) 


(8.2) 
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and so 
M<4a4™. 


If p is a prime number such that m+ 2 < p< 2m-+1, then p divides the 
product 
(2m + 1)2m(2m — 1)(2m — 2)--- (m+ 2), 


but p does not divide m!. It follows that p divides M, and so 
Pp 
m+2<p<2m+1 
divides M. Therefore, 
p<M<a4™ (8.3) 
m+2<p<2m+1 


for all positive integers m. 

We shall prove inequality (8.1) by induction on n. This inequality holds 
for n = 1 and n = 2, since 1 < 4! and 2 < 4?, respectively. Let n > 3, and 
assume that (8.1) holds for all positive integers m <n. If n is even, then 


pan pgn-l 
If n is odd, then n = 2m +1 for some m > 1, and 

IIe=- Il> TT » 

p<n pxm4+l m4+2<p<2m+1 
By the induction hypothesis we have 

[Loe (8.4) 
pxmt+1 

It follows from (8.3) and (8.4) that 


I] »= II Dp II p< gmtlym = 42mt+l = 4". 


p<n p<m4+1l m+2<p<2m+1 
This proves (8.1). 
Inequality (8.2) follows from (8.1) as follows. If « > 1, then n = [a] > 1 
and 
V(x) = V(n) = log II p<nlog4 < xlog4. 


psn 
The proof that (8.2) implies (8.1) is similar. 


We can now prove Chebyshev’s theorem that the functions 0(x), v(2), 
and (a) log x all have order of magnitude «. 
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Theorem 8.2 (Chebyshev) There exist positive constants A and B such 
that 


Az < U(x) < W(x) < a(x) loga < B (8.5) 
for all x > 2. Moreover, 
int = eat UO iene MCS ne 
too zoo E00 x 
and 
lim sup ue) = lim sup ue) = lim sup ae < log 4. 


Proof. Theorem 8.1 gives the upper bound W(x) < xlog4, and so 


lim sup —— ng) < log 4. 
x 


zwL— Co 


We shall compute a lower bound for 7)(a). Let n be a positive integer, and 
consider the middle binomial coefficient N = Gare Applying Theorem 1.12, 
we write N as a product of prime powers as follows: 


N= ( = (n+1)(n+2) oe (2n) = II prr(2n))—2vp(n!) 


n n! nl2 
pS2n 


Up((2n)!) — 2vp(n!) = ok (|=2| a) |=]) : 


By Exercise 7, [2¢] — 2[t] = 0 or 1 for all real numbers ¢, it follows that 


where 


log 2n 
0 < vp((2n)}!) — 2up(nt) < | FE. 
By Lemma 8.2, 
ae og n] 
<N= Il pe —2up(n!) < II p [er F 
ps2n p<2n 
and so ako 
2n log 2 — log2n < S- nee log p = W(2n). 
pan log p 


Let x > 2 and n = [2/2]. Then 
2n<a<2n4+2 
and 


Wx) > Y(2n) > 2nlog2—log2n 
> (a — 2)log2 —logx = x log 2 — log x — 2 log 2. 
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Therefore, 


We obtain a lower bound for V(x) in terms of 7(x) log x as follows. 


then 


and so 


It follows that 


o(2) 


lim inf 
wr CoO 


0< 


v2) 


x 


6<1, 


> log 2. 


> So logp 
ul-$<p<a@ 

> SS (1 — 6) log a 
vi <p<a@ 


= (1-6) (x(x) - ae") log x 
> (1—6)n(x) log — x!~* log x, 


ee 


(1 — 6)1(x) log x 


ae Se eine OE 
av ax 


x 


COS 


This holds for all 6 > 0, and so 


Similarly, 


The inequality 


implies that 


and 


lim inf ute) > lim inf 
x 


BOO. 


O(x) 


Inequalities (8.6) and (8.8) give 


lim inf 
r—Cco 


ox) 


= limin 


Ve) 


x 


xr—0o 


B—+00. 


x w—00 


log x 


m(x) log x 
os 


(8.6) 


(8.7) 


8.1 Chebyshev’s Theorems 273 


Combining (8.7) and (8.9), we obtain 
ox) v(@) 


lim sup —— = lim sup —— = lim sup 
x 


~— 00 ~— 00 x ~—00 


m(a) log x Sieaa 
ge ; 


This completes the proof. 


Theorem 8.3 Let p, denote the nth prime number. There exist positive 
constants a and b such that 


anlogn < py < bnlogn 
for all n > 2. 


Proof. By Chebyshev’s inequality (8.5), there exist positive constants A 
and B such that 


Apn < 1(pn) log pn = nlog pn < Bpn. 
Let a= B~! > 0. Since p, > n, we have 
Pn = Bo nlog pn > anlogn. 


Similarly, 
Pn < A™*nlog pn. 


For n sufficiently large, 


log Dn log n + log log pn — log A 
log n + 2 log log py, 


logn + (1/2) log pn, 


IN IA IA 


and so 
log Pn < 2logn. 


Therefore, there exists an integer np > 2 such that 
Dn < A-'nlog pn < 2A7~'nlogn 


for all n > no. Since p,/nlogn is bounded for 2 < n < no, there exists a 
constant b such that p, < bnlogn for all n > 2. This completes the proof. 


There is a useful notation for describing the order of magnitude of func- 
tions. Let f be a complex-valued function with domain D, and let g be a 
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function on D such that g(x) > 0 for all  € D. The domain D can be a 
set of real numbers or of integers. We write 


f = O(g) 


or 
f<g 


if there exists a constant c such that 
| f(x)| < cg(x) for all x € D. 


For example, Chebyshev’s theorem states that 


If D C R and limsup D = ov, that is, if D contains arbitrarily large real 
numbers, then we write 


f =0(9) 
if 
f(z) 


Zep g(Z) 
It follows from Chebyshev’s theorem that 
m(x) = o(2). 


We also denote by O(g) (resp. o(g)) any function f such that f = O(g) 
(resp. f = o(g)). For example, e* = 1 + O(a) on every interval [1, xo], 
sina = O(2) for all x, and log x = o(«*) for every a > 0. 

We say that the function f is asymptotic to g, written 


frg, 


f(@) _ 


cen ae) 


The prime number theorem states that 1(x) ~ «/log a. Since lim; f(x) = 
a if and only if liminf;... f(z) = limsup,_,,, f(x) = a, Theorem 8.2 im- 
plies that the following asymptotic formulae are equivalent: 


ax 


log x 


& 

ATS, 

& 
2 
8 
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Exercises 


1. Compute the asymptotic density of the set of prime numbers. 


2. Compute the asymptotic density of the set of prime powers. 
Hint: Let I(x) denote the number of prime powers p* < a. Show that 


T(x) = (ya) + (II(a) — I Va)) « r(@). 


3. Compute the asymptotic density of the set of integers divisible by at 
least two distinct primes. 


4. Prove that 
p(x) = V(x) + O( V2). 


5. Prove that w(x) = log N, where N is the least common multiple of 
the positive integers not exceeding x. 


6. Prove that there exist positive real numbers a and ( such that 


n 


MOPS [2 <n, 
i=l 


7. Prove that [kt] —k[t] € {0,1,...,4—1} for all positive integers & and 
real numbers ¢. 


8. Prove that there exists a constant c such that, for all x sufficiently 
large, there exists a prime p such that « < p< (1+c)z. 


9. The prime number theorem states that J(x) ~ x. Prove that the 
prime number theorem implies that for every 6 > 0 there is a number 
xo(6) such that, for all « > xo(6), there exists a prime p such that 
xe<p<(14+6)a. 


8.2 Mertens’s Theorems 


We begin by describing two arithmetic functions whose values are loga- 
rithms of primes. 
We define the function ¢(n) by 


Oe logp ifn=p isa prime power, 
~ 10 otherwise. 


Chebyshev’s function 0() is the sum function of the ¢-function, since 


So e(n) = So log p = 0(a). 


n<ux pcx 
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The von Mangoldt function A(n) is defined by 


_ f logp ifn=p* is a prime power, 
Raye { 0 otherwise. 


Chebyshev’s function ¢(x) is the sum function of the von Mangoldt func- 


tion, since 
SoA) = SO logp = ¥(z). 
nse pk <a 
Moreover, 
S > A(d) = log n. 
d\n 


Theorem 8.4 For x > 2, 

x x 
s w (=) = dA(@) [=| = xlogx — x + O(log x). 
Proof. With f(n) = A(n) in Theorem 6.15, we have 


F(a) = >0 A(n) = ¥(2), 


n<u 


and so 


m<u d<a 


- SYa@ 


n<a d\n 
= y logn 
n<ux 


= s«logr—2x+O(logz). 


The last identity comes from Theorem 6.4. 


Theorem 8.5 (Mertens) For x > 1, 


S- A) = log x + O(1) (8.10) 
and 
yo = log x + O(1). (8.11) 


pSu 


8.2 Mertens’s Theorems 
Proof. Since w(a) = O(x) by Chebyshev’s theorem, we have 


tlogx—x2+O(logr) = 


d<a 
(d) 
= aa O(wp(z)) 
A(d) 
get ae 


We obtain equation (8.10) by dividing by z. 
Next, we observe that 


A(n lo lo 
yA ee = ee 


n<ux pcx P 


IA 
M 
re 
3 
iM 
eo 


IA 
egiie 
| 
ols 


< i. 


This proves (8.11). 


Theorem 8.6 


SS, me = logx + O(1). 


Proof. We begin with the convergent series 


&(k) — Sek) . Slogk 
y 7 oS 72 k2 < 00 
k<a k=1 k=1 


By Theorem 6.3 applied to the function f(t) = 1/t?, we have 


Vn L(k 
yee yryrt 


n<ax n<ak<n 
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by Theorem 8.5. 


Theorem 8.7 (Mertens) There exists a constant b, such that 


1 1 
YF =losloge + 61 +0 ( ) 
v) log x 


psx 
for xz > 2. 


Proof. We can write 


1 logp 1 ate. 
yay BP =D Hmawn, 


psa psx 2<n<e 
where : 
“SP ifn=p 
= P y 
A(m) { 0 otherwise, 
and i 
= — fort > 1. 
g(t) oat or t > 
Let ; 
0g Pp 
F®) =) fm =>>—. 
nxt pxt P 


Then F(t) = 0 for t < 2. By Theorem 8.5, 
F(t) =logt+r(t), where r(t) = O(1). 


Therefore, the integral 
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converges absolutely, and 


/ °° r(t)dt 1 

=O : 
2 t(logt)? log x 
By partial aie we obtain 


a = de f(n)g(n) 


= Flea) - f " F(t)g' (bat 
_ loga+r(z) i logt + r(t) 


log x t(log t)? 
1 ed " r(t) 
= 1+0 dt dt 
+ (aes) | tlogt +/ t(log t)? 
°° r(t) 
= logl 1 — log log 2 
og log x + og log +f i(log t)2 


L atoare***° (igs) 


1 
= loglogr+b,+0O (aes) ; 


where 


b; = 1—-loglog2 ——— dt. 12 
1 og log +f ‘ise pee (8.12) 


This completes the proof. 


Theorem 8.8 (Mertens’s formula) There exists a constant y such that 


for x > 2, 
wet 
II (1-=) =e'logr+O(1). 


pSu 


Remark. See Nathanson [2, pp. 162-165] for a proof that y is Euler’s 
constant, constructed in Theorem 6.9. 
: ¢ ; a oe ere oo —k 
Proof. We begin with two observations. First, the series }7,, )i,—2P"/k 
converges, since 


se ee Des 3 <Lagon <* 


p k=2 Pie eee 


Let 
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Second, for x > 2, 


°<) Le < eo = GD 
~ > c 1 as 


IA 


From the Taylor series 
—log(1—t) = y for |t| <1 


and Theorem 8.7 we obtain 


rl 1\7! 1\7! 
log (1 7 ) log (1 = ) 
pSx P dX P 
a 
->YoG 
psa k=1 


- Dilys 


px pyr k=2 


)+e- Dae 


prox k=2 


1 1 
= loglogr+b,+b2+0O (=) +O (=) 


1 
= logl + by + bg + —— }. 
og log x + b; 2 o() 


1 
= loglogr+b,+0O ( 
log x 


Let Die! by + bo. Then 

ty 1 
II (1-=) = e’ log x exp (0 (; )). 
pe a ne 


Since exp(t) = 1+ O(t) for t in any bounded interval [0,to], and since 
O (1/log x) is bounded for « > 2, we have 


-»(0(@h))-1+2(¢5) 
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e’ log x exp (o( : )) 
log x 
1 
ope (140 ( )) 
log x 


e’ logx + O(1). 


Therefore, 


~~ 
i 
| 
Shr 
Sa 
| 
| 


l| 


This is Mertens’s formula. 


Exercises 


1. Prove that 1 *« A = L, or, equivalently, 


> A(d) = logn. 
d\n 


Prove that A = wx L. 
2. Prove that 


3. Prove that 


and use partial summation. 


4. Prove that 


log*p 1 
S- a a log” « + O(log"! «) 
p k 
psu 
for every positive integer k. 


Hint: Use induction on k. 


5. Prove that 


£ x L(n) logplogg 1, 94 
y y 5 log x + O(log x) 


n<ax pqgx Pq 
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6. 


7. 
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Hint: Observe that 
- log p log q = 3 log p S log q 
a> 2G rae. q ” 
pqse psa qsx/p 


and use Mertens’s formula (8.11). 


Prove that 
l ] l 
< nlogn re pq log pq 


Hint: Use partial summation and the previous exercise. 


Prove that 


lim sup om =. 
n 


nm— oo 


Hint: Use Exercise 12 in Section 7.3. 


8.3 The Number of Prime Divisors of an Integer 


The arithmetic function w(n) counts the number of distinct prime divisors 
of the positive integer n, that is, 


We have 


pin 
wl) = 0, w(6) = 2, 
w(2) = 1, w(7) = 1, 
w(3) = 1, w(8) = 1, 
w(4) = 1, w(9) = 1, 
w(5) = 1 w(10) = 2 


The arithmetic function Q(n) counts the total number of primes whose 
product is n, that is, 


pr ||n 
We have 
Qa) = 0, Q(6) = 2, 
Q(2) = 1, Q(7) = 1, 
Q(3) = 1, Q(8) = 3, 
Q(4) = 2, Q9) = 2, 
Q(5) = 1 Q(10) = 2 
If 


N=Py Po 77+ Dy 
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is the standard factorization of n as a product of powers of distinct primes, 
then 
w(n)=k 
and 
Q(n) =ryt rete + +7rp. 

We shall prove that almost all integers up to x have log log x distinct prime 
factors. We begin with estimates for the mean value and mean-squared 
value of w(n) 


Theorem 8.9 For x > 2, 


S| w(n) = vlogloge + bie +0 ( as ) , 
log x 


n<u 
where b, is the positive real number defined by (8.12). 


Proof. Applying Chebyshev’s theorem (Theorem 8.2) and Mertens’s the- 
orem (Theorem 8.7), we obtain 


Som = OY-YyYs 


n<ax n<a pin psx a 
= ¥[2]-Dz+o@e 
pSx 7 pxa P 
1 4 
2 eye ( ) 
pemey log x 


1 x 
= — (iogtog.e +b +O (=) +O (+) 


= vlogloge + bie +O ( = ) 
log 


8 


Theorem 8.10 For x > 2, 
S- w(n)? = «(log log x)? + O(x log log x). 
n<ux 


Proof. We have 
2 


wey =| S51) S/S 2S 4 


pin piln pain 
= S- 1+ S- l= Se 1+w(n). 
P1p2|n p\n pip2|n 


P1FP2 P1FP2 
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By Theorem 8.9, 


oY 14+ Dew 


n<x n<ax pipeg|n n<u 
P1FP2 


= S- Ss 1+ clog log x + O(2) 


pypgsev n<ex 
P1FP2 P1p2|nr 


- » |] + O(etogtog 2) 


Pip2 


M 
3 
= 
l 


P1P2Z<e 
P1FP2 


= S- oes GG ys 1 | + O(a log log x) 


Pip2 


P1P2Z<e P1p2Se 
P\AP2 P\FP2 
1 
=2 y —— + O(a log log x), 
P1P2<@ P1p2 
P1FP2 


since, by the Fundamental Theorem of Arithmetic, there are at most 2” 
ordered pairs (pi, p2) of distinct primes such that pyp2 < x. From Theo- 
rem 8.7, we obtain 


2 


=e y 


P1p2<e P1p2 psx i 
P1FP2 
(log log x + O(1))? 
= (loglog x)? + O(log log x) 
and 
2 
1 1 1 
P1p2 = » p 7 d pe 
ed PSV PSV 
P1FP2 
= (loglog vz + O(1))? + O(1) 
= (loglog x)? + O(log log x). 
Therefore, 


ys w(n)? = «(log log x)? + O(x log log x). 


n<ux 


This completes the proof. 


We also need the following result, which is essentially Chebyshev’s in- 
equality in probability theory. 
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Theorem 8.11 (Chebyshev’s inequality) Let S be a finite set of inte- 
gers, and let f be a real-valued function defined on S. Let uw and t be real 
numbers with t > 0. Then the number of integers n € S' such that 


|f(n) -pl >t 
does not exceed 


Dl F(n) ~ 1) 


nes 


Proof. If | f(n) — y| > t, then 


and 


card{ne S:|f(n)—pl] >t} = S- 1 
nes 
[f(n)=nl>¢ 


so (f(r) = #) 


t2 


IA 


nes 
lf(r)-nl2t 


oD (F(n) - u). 


nes 


IA 


Now we prove that w(n) has “normal order” log logn in the sense that 
w(n) is close to log log n for almost all n. 


Theorem 8.12 (Hardy—Ramanujan) For every 6 > 0, the number of 
integers n < «x such that 


|w(n) — log log n| > (log log x) até 
is o(a). 
Proof. (Turan [143]) Let S' be the set of positive integers n not exceeding 
x, f(n) = w(n), and uw = loglogaz. Applying Chebyshev’s inequality, we 


see that for any t > 0, the number of integers n < x such that |w(n) — 
log log z| > t is at most 


2 S"(w(n) — log log x)”. 


n<ux 
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We use Theorem8.9 and Theorem8.10 to evaluate this sum as follows: 


SS" (w(n) — log log x)? 


nN<x 


= S- w(n)? — 2log log x S- w(n) + S| (log log x)? 


nx n<a nsx 

= x(loglog x)? + O(xlog log x) — 2 log log x(a log log a + O(x)) 
+ a(log log x)? + O((log log x)?) 

= O(axloglogz). 


Let 6 > 0 and t = (log log z)2+® — 1. Then 
#? > (loglogx)!+?6 — 2(log log x)? *¢ 


= (loglogx)'*° ((log log x)° — 2(log log n)-V/?) 
> (loglog x)!** 
for « sufficiently large. Therefore, if 


T= {ne€ S: |w(n) — log log a| > (log log x)2+6 ze 


then 
log] 
IT] « x log log x : 
((log log x)2+ — 1) 
x log log x 
(log log x)!+¢ 
_ xv 
~ (log log x)é 
= o(2). 
Let x > e°. If 
gle<cn< x, 
then 
0 < loglogx — 1 < loglogn < log log x. 
If 7 
w(n) — log log n| > (log log x)2+°, 
|u(m) — log log g log 
then 


|w(n) —loglogz| > |w(n) — log log n| — | log log x — log log n| 
> (loglogx)?+® —1 
t. 


I 
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Therefore, if 
U={ne€S: |w(n) — loglog n| > (log log a)2t}. 


then U C T and so 
|U| < #1/¢ + |T| = o(2). 


This completes the proof. 


Exercises 
1. Compute w(n) and Q(n) for 11 <n < 20. 


2. Prove that there exists a constant b3 such that for x > 2, 


1 1 
oe = log log z + b3 +O (=) : 


8.4 Notes 


There are many beautiful open problems about prime numbers. Here are 
some examples. 


1. Do there exist infinitely many primes p of the form p= n? + 1. For 
example, 5 = 2? +1, 17 = 4? 4+1, and 101 = 10? + 1. The best result 
is due to Iwaniec [73], who proved that there exist infinitely many 
integers n such that n? +1 is either prime or the product of two 
primes. 


2. The twin prime conjecture states that there exist infinitely many 
primes p such that p+2 is also prime. For example, {11, 13}, {29,31}, 
and {101,103} are twin primes. 


3. The Goldbach conjecture states that every even number n > 4 can be 
written as the sum of two primes. For example, 4 = 2+ 2,8 = 3+5, 
and 100 = 17 + 83. 


4. A polynomial f(t) with integer coefficients has prime divisor p if p 
divides f(t) for every integer t. We say that f(t) represents a prime 
p if there is an integer n such that f(n) = p. Dirichlet’s theorem 
(Theorem 10.9) states that if m and a are relatively prime integers 
with m > 1, then the polynomial f(t) = mt + a represents infinitely 
many primes. These linear polynomials are the only polynomials that 
are known to represent infinitely many primes. 
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It is conjectured that if f(t) is any irreducible polynomial with integer 
coefficients and positive leading coefficient, and if f(t) has no prime 
divisor, then the polynomial f(t) represents infinitely many primes. 


An even more general conjecture, called Schinzel’s Hypothesis H [124, 
125], states that if fi(t),..., f(t) are irreducible polynomials with 
positive leading coefficients, and if the polynomial f,(t)--- f-(¢) has 
no prime divisor, then there exist infinitely many n such that the 
r numbers f)(n),..., f,(m) are simultaneously prime. Many classi- 
cal problems are special cases of this conjecture. For example, the 
problem about primes of the form n? +1 is the case r = 1 and 
fi(t) =t? +1. The twin prime conjecture is the case r = 2, f(t) = t, 
and fo(t) => t+ 2 


A conjecture of Schinzel and Sierpiriski [125] asserts that every pos- 
itive rational number x can be represented as a quotient of shifted 
primes, that is, c = (p+ 1)/(q+ 1) for primes p and gq. It is known 
that the set of shifted primes {p+ 1: p € P} generates a subgroup 
of the multiplicative group of positive rational numbers of index at 
most 3 (Elliott [30]). 


Let fi(t),..., f(t) be irreducible polynomials with integer coeffi- 
cients and positive leading coefficients. Let g(t) be a polynomial with 
integer coefficients. Suppose that there exist infinitely many posi- 
tive integers N such that N — g(t) is irreducible and the product 
fi(t)--- f-(Q)(N — g(t)) has no prime divisor. Schinzel’s Hypothesis 
Hy asserts that if N is sufficiently large, then there exists an integer n 
such that N—g(n) is prime and f;(n) is prime for alli = 1,...,7r. The 
Goldbach conjecture is the special case when N is even, r = 1 and 
fi(t) = g(t) =t. Note that if N is odd, then fi (t)(N—g(t)) = t(N-t) 
has the prime divisor 2. 


Do there exist arbitrarily long finite arithmetic progressions of primes? 
Erdés asked the following more general question: If A is an infinite 
set of positive integers such that the series }),¢ 4 a—' diverges, then 
must A contain arbitrarily long finite arithmetic progressions? If the 
answer is yes, this would immediately imply the existence of long 
arithmetic progressions of prime numbers, since ee p | diverges 
(Theorem 8.7). 


All these conjectures are still open, but important techniques, espe- 
cially sieve methods and the circle method, have been developed to attack 
them, and some deep results have been obtained. More information can 
be found in the following books: Halberstam and Richert’s Sieve Meth- 
ods [47], Nathanson’s Additive Number Theory: The Classical Bases [2], 
and Vaughan’s The Hardy-Littlewood Method (148). 


9 


The Prime Number Theorem 


9.1 Generalized Von Mangoldt Functions 


The function m(a) counts the number of prime numbers not exceeding «. 
Euclid proved that lim, ,.. 7(2) = oo. The prime number theorem (PNT), 
conjectured independently around 1800 by Gauss and Legendre, states that 
m(a) is asymptotic to x/log x, that is, 

on m(x) log 


w— 00 £ 


=: 


In this chapter we shall give an elementary proof of this theorem, where 
“elementary” means that we do not use contour integrals, Cauchy’s the- 
orem, or other results from analytic function theory, but only basic facts 
about arithmetic functions and the distribution of prime numbers that we 
proved in Chapters 6 and 8. 

Recall that the von Mangoldt function A(n) is equal to logp if n is a 
positive power of the prime p, and 0 otherwise. Let L(n) = logn. Then 


L=1xA, 

where 1(n) = 1 for all n. By Mobius inversion, we have 
A=pxlL, 

and so 


A(n) = (w*L)(n) 
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= Sould)L(n/d) 
d| 
= Ln) Soul) — So ula) Ld) 
d\n 


d\n 


= -JSopl(d)L(d). 
dl 


The divisor function d(n) counts the number of positive divisors of n. Since 
d= 11, from Mobius inversion we obtain 1 = yw * d, and so 


A-1l=p*xLbL—-pxed=px(L—d). 


For every nonnegative integer r we define the generalized von Mangoldt 
function A, by 
Ay = pb * EF 


Then Ag = wx 1 = 6, and Ay = wx L = A is the usual von Mangoldt 
function. The elementary proof of the prime number theorem makes use of 
the generalized von Mangoldt function Az. We have 


Ao(1) = 0, Ao(6) => 2 log 2 log 3, 
Ao(2) = log?2, A2(7) = log?7, 
A2(3) = 1log?3, Ao(8) = 5log?2, 
Ao(4) = 3log?2, Ax(9) = 3log?3, 
Ao(5) = log’5, A2(10) = 2log2log5. 


Theorem 9.1 For every positive integer n, 
Ag(n) = A(n) logn + A * A(n). 


Proof. Recall that pointwise multiplication by the logarithm function 
L(n) is a derivation on the ring of arithmetic functions (Theorem 6.2). 
Multiplying the identity LD = 1* A by L, we obtain 


i? = Tek 
= L-(1*A) 
= 1x(L-A)+(L-1)*A 
= Lx(A-L)+LD*«A. 


Therefore, 


Ag = pe LD? =pale(A-L)+p*eLxeA=A-L+AxA, 


which is the formula we want. 
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We can compute the function Ag = * L? explicitly. Let w(n) denote the 
number of distinct prime divisors of n. If w(n) = 0, then n = 1 and 


If w(n) = 1, then n = p*, where p is prime, k is a positive integer, and so 


Ao(p*) u(1)L?(p*) + pu(p)L? (p*-*) 
(k log p)? — ((k — 1) log p)? 
2a 


(2k — 1) log 


I 


l 


If w(n) = 2, then n = p*q‘, where p and q are distinct primes, k and @ are 
positive integers, and 


Ao(p'g’) = w(A)L*(p¥q") + (pL? (p**4°) + wg) L? (pg) 
+ p(pq)L?(p**q°*) 
= L? (p¥q°) -_ Lp" 19°) _ Cipro 2) + L?(p*-1q°1) 
= 2logplogg. 


Let w(n) > 3. 1fn = dk, then either d or k is divisible by at least two distinct 
primes, and so A(d)A(k) = 0. Moreover, A(n) = 0. Applying Theorem 9.1, 
we have 
Ao(n) = L(n)A(n) + S> A(d)A(k) = 0. 
dk=n 

The support of an arithmetic function f(n) is the set of all positive inte- 
gers n such that f(n) 4 0. We have just shown that the support of Ag(n) 
is the set of all integers n with w(n) = 1 or 2. 


Exercises 
1. Compute A2(30) directly from the definition Ag(n) = py * L?. 


2. Prove that 
Ax A= —pL x L. 


3. Prove that 
BP=L?*«A+2L*ITA+1% LA 


and 
Ag = Ao *A+ Lo. 


Prove that the support of Ags is the set of all integers n such that 
1<wu(n) <3. 
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4. Prove that 


r 


pHs »: @ L'-* x LEA 


k=0 
for all r > 0. 
Hint: Use L = 1* A and Exercise 6 in Section 6.1. 


5. Prove that 
Aya, = DA, +Ax* A, 


for allr > 0. 


6. Let r > 1. Prove that the support of A, is the set of all positive 
integers n such that 1 < w(n) <r. 


7. For a positive number x and positive integers d and n, define 


Ad) = Ax(d) = w(d) log? = 
and 
A(n) = O,(n) = 1* A(n) = S~ ACA) 
d|n 
Prove that: 
(i) 


(ii) If u > 1, then 
A(p") = log plog —. 
Pp 
(iii) If u,v > 1, then 
A(p“q") = 2 log plog q. 
(iv) If m is the product of the distinct primes dividing n, then 


O(n) = O(m). 


(v) If n is square-free and p divides n, then 


riven (S) tl). 


(vi) If n is divisible by three or more primes, then 
O(n) = 0. 


Hint: Reduce to the case of square-free integers n, and use in- 
duction on the number of prime factors of n. 
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9.2 Selberg’s Formulae 


The elementary proof of the prime number theorem begins with a formula 
of Atle Selberg for a sum over products of primes not exceeding x. We give 
several versions of this formula. 


Theorem 9.2 (Selberg’s formula) For x > 1, the mean value of the 
generalized von Mangoldt function Ag is 


S © Ao(n) = 2x log x + O(2). (9.1) 


n<ux 


Proof. We begin with a computation that uses the estimates in Theo- 
rems 6.9, 6.11, 6.12, and 6.16. 


Yan) = ean) 


na na“ 
= SO u(d)log? k 
dk<a 
= Sond) S> log*k 
d<ax k<a/d 
2 
= Sao (; (log =) 7 log | 7 +0 (tox? 5) ) 
= 2 HO tog (log = 2) 2 yo 40 Sol g? 
d<a d<a d<a 
= 2° Bg) log = (log = 2) O(x) 
d<a 
= 2 HO tog S- ~-;-2+0(5) + O(z) 
d<au m<a/d 
= MO og S- = =—(y+2)e HO og + 02) 
d<a m<a/d d<a 


We estimate these two sums separately. The first sum gives the main term 
in Selberg’s formula: 


d<a m<a/d 
= u(d) 
dm d 
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x) >= a Hee )log = 


I 


eae 
- a ~~ u(d) — PS ~ > (a) log d 
n<au d\n n<x d\n 
= wlogr+2 S- a) 
n<u 


= 2«logr+O(z), 


by Mertens’s formula (8.10). Finally, using Theorem 6.16, we obtain 


d x d 1 d 
yes = DAD ~-,+0(S) 


d<a d<a m<a/d 
(d) (d) 
= ar Md 
2. Oa): 


This completes the proof. 


Notation. By ae Wea denote the sum over all ordered pairs of primes 
(p,q) such that pq < x. For example, 


S © log plog q = log 2log2 + log 2log3 + log 3 log 2 
pqs8 


= log?2+4+ 2log2log3. 


In the elementary proof of the prime number theorem we shall use the 
following equivalent forms of Theorem 9.2. 


Theorem 9.3 (Selberg’s formulae) For x > 1, 


S- log? p + S- logplogg = 2xlogx+ O(a), (9.2) 
pcx pqcux 
O(x) log x + a logp 0 (=) = 2wlogx+O(z), (9.3) 
psu x 
log plog q q x 
1 = 62 ———— }. A 
doloer+ Dy log pq | cm 1+ log x ee) 


psa pqcx 
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Proof. By Theorem 9.1, 


S5 Ao(n) = $5 A(n) logn + $7 A* A(n). 


n<x n<a na 


We consider the last two sums separately. The first sum is 


S- A(n) logn = Slog? p + Des k log? p. 


nsx psx pk <a 
k>2 


If p* < # and k > 2, then p < \/z, and so 


[ies] 
Y klos’y = YD boety DS 
Pk Se pSVa k=2 
log x \? 
Ss. ioe? v( = ) 
log p 
PSVva 
< VJalog’x 
<< &. 


Therefore, 


Ss" A(n) logn = s log? p + O(2). 


n<ux pcx 


For the second sum, we have 


ST Ae A(r) = SYS) SS AWA) 


n<a n<ax2 n=uv 
= S© lgplogq 
pkql<a 
k,e>1 
= S— log plog q + S- log p log q. 
pqsa pk qt<a 


We apply Chebyshev’s theorem to estimate the remainder term. 


S5 logplogg < > logplogg+ S > logplogg 
pkql<e pkq<a pkqt<a 
k+l>3 k>2 e>2 
k,e>1 e>1 k>1 


2 S- log p log q 


pkql<a 
k>2 


e>1 
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= 25° logp S- log q 


pk <a qe<a/pk 
aces e>1 


= 2 rosy v (5 =) 


pk <a 
ces 


& ss nlogP 


pk <a P 
k>2 


< Sloe oe 


pcx 


«< «)- la 


<< f. 


Therefore, 


$7 A* A(n) = ye log plog ¢ + O(2). 


n<x pqgax 
It follows from Theorem 9.2 that 


S7Ao(n) = S > A(n)logn+ SA * A(n) 


nsx n<a“ na 
- S| log? p + Se log plog q + O(a) 
psx pqsa 


= 22wlogr+O(z). 
This proves (9.2). 
Recall the arithmetic function 
i= { log n if n is prime, and 


0 otherwise. 


We have U(x) = YO, <,, €(n), where 0(x)/x = O(1) by Chebyshev’s theorem. 
Applying partial summation, we have 


S- log’ p = ye €(n) log n 
psx na 
” Ot 
= v(x) logx -f OO a 
1 
= v(x) logx + O(2). 


Also, 


> log plogq = ) > log p SS loge = tog 0 ( ). 


pqcx pcx q<a/p pcx 
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Inserting these two identities into (9.2), we obtain (9.3). 
Consider the function f(n) = &(n) logn + £ * &(n). We can restate for- 
mula (9.2) as follows: 


nx 

= S-(e(n) logn + € x &(n)) 
n<ax 

= S (log? p + a log p log q 
psa pqsa 


= 2wlogx+O(z). 


Also, F(x) = 0 for x < 2. Applying partial summation, we obtain 


Slog p + 3 logplogq _ s £(n) logn + £ * &(n) 


psx pqcx dogg Pq 2<n<a log ia 
- ve 
eee 
F “ F(t 
_ Fle) op Oy 
log x 2 tlog’t 
_ 2xlogx + O(a) if: 2tlogt + Olt) 4, 
— log x 9 tlog? t 
= 27+0O (= ) ; 
log x 
by Exercise 1. If x > e, then 
x 22 
< 9 
logx ~ 1+logaz 
and so 
x x 
s () =0 (; —) 
Ifl<a<e, then 
x 
1+loga ~ 
and 1 1 
0< S/ log p + se pew < log 2, 
2 z, logpq- 
psa pqs 
and so 
log plog q 
- ] 1< ——. 
‘a dilosp+ d_ log pq — = lone 


psa pqcax 
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This completes the proof of (9.3). 


Exercises 


1. Let « > 2 and k > 1. Use integration by parts to prove that 
| dt 0, 2 ie | %. dt 
2 log” fie log* z log” 2 2 fogs t 


Prove that 
ie ee O ( x ) 
2 log*t! t 7 los’? a)’ 


where the implied constant depends on k. 


Hint: Divide the interval of integration [2,2] into two subintervals 
(2, Ja] and [Yz, 2]. 


2. Let « > 2 and n> 1. The logarithmic integral is the function 


Prove that 
“. (k-1)! 
We) = > <P 60, (—S-), 
ho, «(0g log’ "x 

where the implied constant depends on n. 
Prove that * 

li(a) ~ : 

az) log x 


3. Show that formula (9.4) implies formula (9.3). 


4. Define the positive real numbers A and a by 


o(@) 


lim sup —— =A 
x~— 00 x 
and Fs 
lim inf ae) =a 
w—- CO x 


Observe that a < A and that the prime number theorem is equivalent 
to the statement that A = a = 1. Use Selberg’s formula (9.3) to prove 
that 

At+ta< 2. 


Hint: Note that 0(x) > (a —)« for all x sufficiently large. Choose a 
sequence of real numbers x; such that x; goes to infinity and 0(a;) > 
(A — ¢€)a; for x; sufficiently large. Use Theorem 8.5. 
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5. Use Selberg’s formula (9.3) to prove that 
A+a>2. 


Conclude that A+ a = 2, and that the prime number theorem is 
equivalent to A= a. 


9.3 The Elementary Proof 
We define the remainder term R(x) for Chebyshev’s function 0(a) by 
R(x) = V(x) — 2. 


We shall prove the prime number theorem in the form (x) ~ a, or, 
equivalently, R(a) = o(a). More precisely, we shall prove that there ex- 
ist sequences of positive real numbers {6,}°°_, and {um }°°_, such that 
limyn—sco Om = O and 


|R(x)| < bmx for all z > Um. 


The argument is technically elementary, but delicate. 
We need the following estimate. 


Lemma 9.1 For x>e, 


l 
ee < log log x. 


Proof. By Mertens’s theorem (Theorem 8.5), for every positive integer 
j we have 


SP (log "5 + 0(1)) - (log = + 011) = O11). 


Pp J 
Fs os 
7 SPS Gat 


Moreover, if 


x x 
poe pial 
then 
j<1+log— <j+l, 
Pp 
and so 


l 1 l 1 
Og Pp a S- Og Pp < 


= pe gy P (1 + log 2) Hiiepe ee A 
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Therefore, 
log z]+1 
a, SS ons NOE St 
pax P (1 + log «) j=l S<pssh p (1+ log «) 
log z]+1 1 
« Ys 
jar 4 
< loglogs. 


This completes the proof. 


Theorem 9.4 For x> 1, 
|R(x)| < 1 x |R(2)|+0 x log log x 
x a. ae ean 
~ log a n log x 


Proof. Replacing 3(x) by «+ R(x) in Selberg’s formula (9.3), we obtain 


2zlogx+O(r) = H(2)1og.2-+ F togp 0 (=) 


pSu 


= (e+ Rta) tae + Toee (E+"(2)) 


pSu 


1 
= vlogr + R(x)logr +a )~ Euee +S°R (=) log p 


pXau pKu 


= R(x)logx+ S- R (<) log p + 22 log x + O(a). 
psu 
This gives 
R(x) loga = — SR (=) log p + O(2). (9.5) 
psu 


We denote prime numbers by p,q, and r. Let p < x. From (9.4) we have 


l l 2 
S- log q S- =e = fy x LO x 
qsx/p qr<a/p ae . Pp (1 + log “) 


Then 


S¢ logplogg = S logp S© logq 


pqsx psx q<x/p 
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log p log plog qlogr 
— 22 ——— — SEER NEESER 
py 


log gr 
pXu parca o¢ 


log p 
p<« P (1 + log “) 


l l 
= 2e(loge +O(1))— D> PPEPET S~ dog p 


l 
ae psa/qr 


+Ola2 


qr<ax 
log p 


p< P (1 + log ) 


log glogr x 
= 2xlogx Se, naar 0 a + O(« log log x), 


+Ola2 


qr<ax 


where the error term comes from Lemma 9.1. Inserting this expression for 
pg<2 10g Plog q into Selberg’s formula (9.2), we obtain 


] ] 
S/ log” p = S- aS aul (=) + O(a log log z). 


a Ee log pq pq 
Therefore, 
log plog q (=) 
U(x) log x = ————- 9 | — ]} + O(a log log z). 9.6 
(x) log oy epi oe (x log log x) (9.6) 


Replacing 0(x) by « + R(x) in (9.6), we obtain 


werioet (= (=)) 
a+ R(x))logxr = +R + O(2 log log x 
(x + R(2x)) log by er tas a (x log log x) 


log pl log pl 
= gS PERE Ss cee IR (=) 


pace PUlEPA Xi, lee pa pq 


+ O(« log log x). 
By Exercise 6 in Section 8.2, 


S log plog q 


= log x + O(log log x), 
Bi loa g (log log x) 


pqgx 


and so 


l ] 
R(x) log x = S- ese (=) + O(2 log log x). (9.7) 
pace 1O8PY pq 
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Adding formulas (9.5) and (9.7), we obtain 


log p1| 
Semerle 5 )|+ 2a wae Ape 
ron p/\| “2, logpa pq 


+ O(a log log x) 


2|R(x)| log x 


IA 


Lx &(n) 


= Sela + Se) 
a O(a log log x) - 
= S- (410) | a) |r (=)| + O(x log log x). 


n<ux 


We can write the partial summation formula (6.6) with a = 0 and 6b = [2 
as follows: 


Yo f(r)g(r) = YO F(m)(g(m) - g(n + 1)) + F(@) g(a). 


Let t+ 0n) 
Flin) =n) + EO 
and g(n) = |R(a/n)|. By Selberg’s formula (9.4), 
_ Lx l(n)\ _ x 
Fa)= >) fm) = s; («1n) + ee ) =22+0 (is) 


na nx 


> (e+ EO) 


= (+0 (rgs)) (POI) 


n<a-1l 


* 2+0(crtes)) |e) | 


We evaluate these terms separately. The main term is 


2 > »(le@-|e(a)) 


n<a-l 


= 25 nfe[-2 Es 


lI 
i) 
3 
ms) 
——™~ 
wales 
| 
i) 
| | 
| M | 
= 
| 
a 
ae) 
o— 
3/8 
NS 
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aC) 


= 25 |n()]-20 
- 2D |n(D] +00 


since 1 < x/[a] < 2 for all x > 1, and so V(x/[z]) = 0 and R(a/[x]) = O(1). 
To evaluate the second term, we begin by observing that 


(|e aaa) 


| 
S 
“~~ 
3/18 
NY 
3/8 


lA 
S 
MS 
318 
le” 
S 
| a ry aa 
3 
S 
ae 
NW 
——N 
S18 
3 
tle 
Ae 
— 


x x x x 
S o(5) u .)), n n+l 
£ x x x 
~ 0(5) 0(—)+5 n+1 
x x x 
0(5) 0(—2) +4. 
Therefore, 
n x x 
a (<5) (|r (2) R (5)) 
n x x 
s (ste) (0(2) o()) 
ne 25 (ve oen’ 
We have 


E, Gigs) (°@ -* a) 

&, (rhe) ie a) 
MON Ga sen) 3) 
POOd Ccorrarre sac 


= 1s A sax)? (2) 


IA 
S 
~~ 
= 
\ 


eR 


304 9. The Prime Number Theorem 


1 
K C+2zr aT EEE 
2 n(1 + log n) 


< z«loglogz, 
since i 
——— = O(log] 
ae n(1+ log n) \log lesz) 


by Exercise 11 of Section 6.2. 
The third term is simply 


(29+ (raras)) P(E) [= 


Combining these results, we obtain 


2|R(x)|loga < 2 y, IR (=)| + O(a log log x). 
n<u 


Dividing this inequality by 2 log # completes the proof of Theorem 9.4. 


Lemma 9.2 Let 0 < 6 < 1. There exist numbers co > 1 and x1(6) > 4 
such that if x > x71(6), then there exists an integer n such that 


a<n< e/a 


and 
|R(n)| < én. 


The constant co does not depend on 6. 
Proof. By Theorem 6.9, 
1 
> —=logr+y7+r(x) = logx+ O(1), 
n<ux M 
where |r(a)| < 1/x. If 1 <a <2’, then 
1 / 
d= = log = + 1'(2), 
n x 
e<n<a’ 
where |7r’(x)| < 2/x. By Theorem 8.6, 


S- _ = log x + O(1). 


n<ux 
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Then 
Rin V(n)—n 
ys a = pe A 
7 = (loge + O(1)) — (log + O(1)) 
= “OGL: 
and so 


e<n<a’ 


for all 1 < x < x’. Choose cp > 1 such that 


S- Rw << (9.8) 


for alll <a <a’. 

Let 06 < 1 and p = e®/*. Then px > ex. Choose 21(6) > 4 such that 
logx < 6x for all x > 2,(6). We must prove that if « > x,(6), then there 
exists an integer n € (2, px] with |R(n)| < 6n. There are two cases. 

In the first case, we assume that either R(n) > 0 for all integers n € 
(x, px], or R(n) < 0 for all integers n € (x, px]. Then 


S- ae) = S- CO S- ES & 


e<n<px Lr<n<px r<n<px 
If 
R 
m* = min { (n)] Ine (xpa}, 
nr 
then 
R 1 
© ( zee) 2 
n n 
r<n<px 
: 1 
=m = 
nm 
Lr<n<px 
2 
> m (108 ee =| 
x ax 
x [ Co 1 
> ance 
as ( 6 >) 
Com* 
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and so 
0<m* <6. 


There exists an integer n € (a, px] with |R(n)|/n = m*, and so 
|R(n)| < én. 


In the second case, there exist integers n — 1 and n in the interval (a, pz] 
such that R(n — 1) 4 R(n) and R(n — 1)R(n) < 0. Moreover, n-1> a> 
x1(6) > 4, and so n > 6. For every integer n > 2 we have 


R(n) -— R(n-1) = VWn)-V(n-1)-1 
= logn — 1 if n is prime, 
7 —1 if n is composite. 


It follows that if R(n) < R(n — 1), then R(n) — R(n — 1) = —-1. Since 
R(n) <0 < R(n—1), we have |R(n)| < 1 < logn < 6n. If R(n—1) < Rin), 
then R(n — 1) <0 < R(n) and 


0< R(n) < R(n) — R(n— 1) = logn—-1 < logn < én. 


In all cases, there exists an integer n € (x, px] such that |R(n)| < 6n. This 
completes the proof. 


Lemma 9.3 Let co > 1 be the number constructed in Lemma 9.2. and let 
0<6< 1. There exists a number x2(6) such that if x > x2(5), then the 
interval (a,e/*x] contains a subinterval (y, e°/?y] such that 


|R(t)| < 46t 
for all t € (y, e8/y]. 
Proof. We begin with Selberg’s formula in the form (9.4). For x > 1, 


l l 
Slogp+ >> nou wet =20+0( +). 


psx pqsax log pq Pd 


For 1 < u<t we have 


0 < SO logp 


u<pct 
log plog q 
< ¥ bep+ yo Meeeea 
u<pct u<pqct 


t u 
Sra ee (<a) +o( eS) 


= xe-u) +0(— 5), 
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since the function t/(1 + logt) is increasing for t > 1. Moreover, 
S- log p = W(t) — 0(u) =t—ut R(t) — R(u), 
u<pct 


and so 


H(i eR Ba Se u+o( : 


It follows that if 1 <u < t, then 


t t 
_ < | < | 5 
|R(t) — R(u)| <t—u O(a) <t-u 0() 


Ifl<t<u< 2t, then 


|R(t) — R(w)| 


IA 
S 


u 
t+O 
(es) 
2t 
< —al+t i 
< |f-a o(5a) 


t 
< |t-u+0(4). 
logt 


In particular, if wu > 4 and t/2 < u < 2t, then 


IR(t)| < [R(u)| + |t— ul +O (=) | 
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(9.9) 


By Lemma 9.2, there is a number co > 1 such that if 0 < 6 < 1 and 


x > %1(6) > 4, then there exists an integer 
ne (a,<%/*s 


with 
|R(n)| < én. 


If t is a real number in the interval [n/2,2n], then t/2 < n < 2t. Since 


n >a > 4, we have 
logt > log(n/2) > log(«/2) > (log «)/2, 


and 


|R(t)| 


IA 


t 


< in+|t—n) +0 (<4) 
log x 


tee le) 
t n log x 


t 
¢(26-+ [5-1] + = ) 
n log x 


IA 
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for some constant cz > 0. If x > x2(6) = max (71(6), e°/°), then 
t 
IR(t)| <t(36+)——-1)). 
n 
Choose ¢ in the interval 
e 4/2 <t< 4/2, 


Then t € (n/2,2n) since e°/? < e!/? < 2. If t/n > 1, then 


> 


t t 
+a 1]=2-1<e—-1.<5 
n n 


since e6/? <1+6 for0<6 <1. Ift/n <1, then 


Sj agp te er aT ay, 


Therefore, 
|R(t)| < 46¢. 


We define the number y as follows. If e6/?n < e°/*x, let y = n. If 
€4/2n > e/a, let y = e~*/2n. Then 


y =e 8/2 > eS eS = el0/9)-Sy > x, 
since cg /d > cg > 1 > 6. In both cases, 


(ye? yl C @, e/a] 


and |R(t)| < 46t for all t € (y, e°/?y]. This completes the proof. 


Theorem 9.5 (Prime number theorem) For Chebyshev’s function 0(2), 
Our)~a 
as L—> OO. 


Proof. By Theorem 8.1, 


jintanp = imap —1<log4-1<04. 
x 


L000 x L000 


By Theorem 8.2, 


v 
lim inf AY), = lim inf ve) —1>log2—-1> -—-0.4. 
«LOO ve mw CO x 
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It follows that there exist numbers M and wu, such that 
|R(z)| < Mx for all x > 1, 


and 
|R(x)|< 6:2 for alla > wz, 


where 
6, = 0.4. 


We shall construct sequences of positive real numbers {6,,}°°_, and 
{Em }o°_,, such that 
6, > 62 > 63 >:-- 


and 
lim é€m = 0. (9.10) 


m— co 


Let m > 1, and suppose that we have constructed the number 6,,. Let 
Co > 1 be the number defined in Lemma 9.2. Choose ¢,,, such that 


0< em <1/m 


and 
62 1 
1 m)(l1—-—@ 
Soe ( ts) < 
We define 
on 6 9.11 
bm, =(1 m 1-—”". m: 5 
11 = (1+ em) (1- Pe (9.11) 


Then 0 < 641 < 6m. This determines the sequences {6,,, }°°_, and {€n,}°_1 
inductively. 
We shall prove that for every m there exists a number u,,, such that 


|R(x)|< 6,0 for all x > Um. (9.12) 


Let us show that this suffices to prove the prime number theorem. The 
sequence {6,,, }°°_, is a strictly decreasing sequence of positive real numbers, 
so the sequence converges to some nonnegative number 6 < 1. Then (9.10) 


and (9.11) imply that 
62 
— 1 — —— — 


Inequality (9.12) implies that R(x) = o(a), which is equivalent to the prime 
number theorem. 

We construct the numbers u,, inductively. There exists u; such that 
|R(x)| < 6,” for x > u;. Suppose that u,, has been determined. We shall 
prove that there exists a number um4+41 such that |R(x)| < é6m412 for all 
x = Um+1- 
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Define 5 
= 
we 8 
and 
p= 00/8 . 


Let 22(6/,) be the number constructed in Lemma 9.3, and let 
x3(m) = max (22(6),,), Um) - 


If 
x > 23(m) > x2(6,,), 


then by Lemma 9.3, every interval (x, pz] contains a subinterval (v. com/ *y| 
such that fi 
|R(t)| < 46,,¢ = aoe 


for all t € y,ebu/y] . Let k be the greatest integer such that p* < 
x/x3(m). Then 
1 
k< SBP) oops Ly 
log p 


and so 


log p 
pikes 


= 1 . 
Sa 


By Theorem 9.4, 


IR) < > |R(2)| +000) 


log x “=, 
1 Z 1 x 
eg sl) age sa) 
log x d n log x = n ree) 
n<p pe <n<a 
1 x Mea 1 
Ze R(=) vs = + o(z) 
log x >» n log x oe, n 
1 x 
s EE II +0 
~  logar oz n om) 


since 


BS Sg =low(pes(m)) + O(1/2) = 010). 


pk<n<a xz/(px3(m))<n<ax 


9.3 The Elementary Proof 


Ifl<n< p*, then 
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= > = > 23(m) > 
—> = >23(m fn 
i pe ate 
and 
(| <A 
n n 
by the definition of u,,. 
For j = 1,...,k, we have 
x x 
7 2 —_ 2 a3(m) 2 w3(6,,); 
pi ~ pk 
and so each interval (4. P| contains a subinterval J; = (ws, efin/2y| 
such that 
mt 
IR(t)| < 45',t = nt fee eer: 
Therefore, 
x x 
DEEN a5 Dep cll hg) Oa eta) 
n€(pi-1,p4] ne (pI~1,pF]\I; nel; 
1 Om x 1 
n€(pI~1 ,pI]\I5 nel; 
1 bmx 1 
PARE aor Dra 
n€ (pI~1, p94] nel; 
Then 
x E x 
ee See oe) 
n<p* J=1 ne (pI—1,p7] 
k 
1 bmx 1 
< 6 — — — = 
matty [one Dy a doe 
j=l n€(pi-1 3] nel; 
k 
LPO p6e 1 
= bn2 Sy £- Sey ye E 
n<pk j=l nel; 


We have 


1 1 
bm & — = §mx (‘108 » +0 (=)) 
n p 
n<pk 


bmx log x + O(2). 


I 
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Moreover, 
1 1 6 1 6 p? 
ee 
met; n€(yj,e°m/?y5] 
and so 
k k 
1 6k 
SoSo=- = +0 ia 
n 2 x 
j=l nel; j=l 
6, (bm log x 
= — O(1 O(1 
m (SE + 011) +000) 
62 loga 
m O 
128c9 z ( ) 
since i 
se p(o® —1) _ 2p* 2 _ ou) 
ra (rer) 
Therefore, 


Sim 1 63 2logx 
os oy no Deb 
Combining these results, we obtain, for z > x3(m), 


63 x log x 


> [R(=)| < (dma log a + O(2)) ( oer +0(0)) 


62 
[5 cs PE al 
( sg) 6 x log x + O(a) 


and 


|R(2)| < ions > JR (=)| + o(@) 


62 
ea) een 
( sg) 6 x + o(x) 


We choose u»+1 sufficiently large that for all x > Um+1 we have 


62 
O(a) < Em (1 = smi | Om. 
Then 52 
|R(ax)| < (1+ €m) (1 _ | Om& = bm412. 


This completes the proof of the prime number theorem. 
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Exercises 
1. Let p, denote the nth prime number. Prove that p, ~ nlog n. 


2. Prove that 
< Pn+1 me 
lim —— =1. 


no Pn 


3. Let 6 > 0. Prove that 
O((1 4+ 6)a) — B(x) ~ 6x. 


This implies that there is a prime between x and (1+ 6) for all 
sufficiently large x. 
4. Prove that 
6x 


m((1 + 8x) — m(2) ~ SS. 


5. Prove that 
m(2x) — (a) ~ r(x). 


6. Let p, denote the nth prime number, so that pj = 2, po = 3,.... 
Prove that the asymptotic formula p, ~ nlogn implies the prime 
number theorem. 


7. Deduce Selberg’s formula (9.3) from the prime number theorem. 


8. Let 6; = 2. For every m > 1 define 


62 
ia = bm, Pe m . 
bm+1 = 6 ( ae 


Prove that 
0<ébm< 


= 
vm 
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For any positive integer n, the arithmetic functions w(n) and Q(n) are 
defined as follows: w(n) = k if n is divisible by exactly & different primes, 
and Q(n) = @ if n is the product of @ not necessarily distinct primes. If 
n = pi'---pr*, where pi,...,px are pairwise distinct prime numbers and 
@1,...,@,% are positive integers, then w(n) = k and Q(n) = a, +--+ + ay. 

Let 7(x) denote the number of positive integers n not exceeding x that 
can be written as the product of exactly k distinct primes, 


T(x) = Ss 1. 


n<« 
w(n)=Q(n)=k 
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Let 7{(z) denote the number of positive integers n not exceeding x that 
can be written as the product of exactly k not necessarily distinct prime 


numbers, 
1 (“) = a 1. 
n<a 
Q(n)=k 
Our goal is the following asymptotic estimate for the number of integers 
with exactly & prime divisors: 


‘ x(log log )*~4 
T(x) ~ TEL) ~ “(k—D!loga | 


This is a generalization of the prime number theorem, since 71(x) = m7} (x) = 
(x) ~ «/log a. 

Let P = {2,3,5,...} be the set of prime numbers, and let P* be the set 
of all ordered k-tuples of primes. Let r;(m) denote the number of represen- 
tations of n as an ordered product of & primes, that is, 


Since every positive integer is uniquely (up to order) a product of primes, 
we have 

O<rzp(n) <k! for alln > 1. 
Moreover, r,(n) = k! if and only if w(n) = Q(n) = k, and 0 < rz(n) < kl if 
and only if w(n) < O(n) =k. 


Theorem 9.6 Fork > 1, let 


n<u (Pipes ppePk 
Pi PRSe 
Then 
kimp(x) < I(x) < kiai(xr) « ox. 
For k > 2, 


Proof. We have 


I(x) = S- rp(n) < k! S- Ll=kinj(2) <kla <a 
n<x n<e@ 


= rp(n)>0 


and 


T(x) = So re(n) =k So 1 = km (2). 


n<x n<a 
= rp (n)=k! 
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Let k > 2. The function a{(x) — m,(#) counts the number of positive 
integers n < x that can be written as a product of k primes but not as 
a product of & distinct primes. Every such integer is of the form n = 
Pi-** Pk—2Dz_1- Therefore, 


mk(a) — a(x) < S- 1 


ale 


I 

oq 
Cos 
a 
— 

8 
wa 


This completes the proof. 


Theorem 9.7 Let So(x) =1. For k > 1, let 


CE as 
Pi PR Se 
Then 
Si.(x) ~ (log log x)" 
and 


ster 58 (5): 


Proof. By Theorem 8.7, 


1 
Si(a2) = S- a log log x 


pSu 
and so 
Si(a/*) ~ log log a'/* ~ log log x 
for all k > 1. Since 
k 


(a(e))" = (yt}= > — 


IA 
M 
- 
| 
# 
& 


x 
M 
le 
II 
me 
S 
= 
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it follows that 
Si,(x) ~ (log log x)*. 


Also, 


1 
S.(x) = —— 
Kl ) ms P1***Pk-1Pk 


(Piss Pk—1:PK)EP* 
PL PR-1PkS& 


1 
= = » Pras 


Pro (P15. PR—1)€Pk-1 
Pl PR—-1S@/PRE 


This completes the proof. 


Theorem 9.8 For k > 1, let 


Then 
O(a) ~ ka(log log a)*~!, 


Proof. For j = 1,...,4+1, let 


k+1 
Pic: *'Pr+i = I] Di- 


5 


So log pi +++ Py Det = log(p1- -Prti)® = klog pi -- 


and so, by Exercise 4, 


k0e¢1(@) = = klog pi---Pk+1 


(pqs PRt1)EPRtl 
Pl -PR41S2 


k+1 


= S> +s log pi: 


(Piso Pry EPRHT G=1 
Pl PR41S% 


*Pk+1; 


*Pk+1 
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= 5 (k +1) log pi --- pe 
(pqs PR41)EPRt1 
PL -PR41S2 


= (k+1) > Se log pi +++ Dk 


Pkoi SX (py4---sPp)EP® 
PL PRS&/PR41 


= +1) Da (2). 


For k > 1, let 7 
F(z) = 04(x) — kaSy_1(2). 
Then 
kFegi(z) = k0p4i(e) — k(k + 1)xS;(2) 
= the nym (=) —k(k+ Dy (=) 
- wn (a(8) #8) 
- ary Das). 


We shall prove by induction that 
Fi,(x) = 0 (ax(log log g)P-*) : (9.13) 


For k = 1, 
F(a) = V(x) — « = o(a) 


is the prime number theorem. Suppose that (9.13) is true for some k& > 1. 
Let ¢ > 0. There exists xo(e€) such that 


|Fi.(a)| < ex(log log a)*~! 
for all x > xp = Xo(e), and so 
1 
S- Fr (=) < ex(log log x)*~1 SS — < 2ea(log log x)" 
Pp 
pSa/ro pa/xo 


for « > 2, = a1(€) > 2. Since the functions 0,(a) and S,-1(a) are 
nonnegative and increasing for x > 1, it follows that Fy,(a) is bounded on 
any finite interval, and so there exists a constant M, = M,(e) such that 


|Py(a)| < My for l<a< 2}. 
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Therefore, 
x 
kFen(e) = (+) DA (=) 
pSa P 
= @+) DA(Z)+e+0 DY a(%) 
psa/xo P x/xo<p<x y 


2(k + 1)ea(log log x)* + (k +1)Mi1(2) 
Akex(log log x)* + 2kM,z. 


IN IA 


Dividing by k, we obtain 
Fyii(x) & ex(log log x)* 
for all sufficiently large x. This proves (9.13). It follows that 


O.(xz) = kaSp_1(x) + F(x) 
kx(log log x)*~! + 0 (x(log log x)*~") 
Roa 


I 


2 


ka (log log x) 


This completes the proof. 


Theorem 9.9 Fork > 1, 
7 x(log log x)*~4 


Proof. This follows from Theorem 9.8 by partial summation. We have 


0, (2) = SS log p1-+* Pr = r.(n) logn, 


(p15--.PR)EP* n<ax 
PL PREZ 


and, by Theorem 9.6, the arithmetic function r;,(m) has mean value 


T(x) = So re(n) = O(2). 


n<x 
Then 
” TLi (t)dt 
Ox(e) = Th(a)loge— f° SOE 
1 
= I{(«)logr+O(z). 


It follows that 


0 ka (log | net 
nate) = 22 4 0-22) ~ Meewtowe 
og x og x 
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For k > 2, 
p-1(a) = 0 (IT, (x) - 


By Theorem 9.6, 
Ili (a) < klak(a) < klar, (a) + KIOG_y (x) < W(x) + KITZ_1 (2), 


and so 
k(x)  a(loglogx)*~1 


kl (k—1 Noga | 


m(@) ~ Tr(a) ~ 


This completes the proof. 


Exercises 
1. For every positive integer n, let r,(m) denote the number of k-tuples 
of prime numbers (p,..., px) such that n = p,--- pz. Compute r3(n) 
for n < 50. 


2. Compute r4(n) for n < 100. 


3. Let 0 > 1. Prove that 


ed DOE 


pEP 


4. Prove that 


k+1 


> So log pi +++ By ++ Prva 


(Pi PReEPKTT G=1 
Pl PR41S@ 


= ys (k + 1) log pi +++ De. 


5. Let x, be the smallest number such that 7,(a2,) > 0. Prove that for 
every € > 0 there exists an integer ko = ko(e) such that if k > ko, 
then 


RG-e)k <a < Rater. 
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9.5 Notes 


In a lecture delivered to the Mathematical Society of Copenhagen in 1921, 
Hardy said, 


No elementary proof of the prime number theorem is known, 
and one may ask whether it is reasonable to expect one. Now 
we know that the theorem is roughly equivalent to a theorem 
about an analytic function, the theorem that Riemann’s zeta 
function has no roots on a certain line. A proof of such a theo- 
rem, not fundamentally dependent upon the ideas of the theory 
of functions, seems to me extraordinarily unlikely. It is rash to 
assert that a mathematical theorem cannot be proved in a par- 
ticular way; but one thing seems quite clear. We have certain 
views about the logic of the theory; we think that some theo- 
rems, as we say “lie deep” and others nearer to the surface. If 
anyone produces an elementary proof of the prime number the- 
orem, he will show that these views are wrong, that the subject 
does not hang together in the way we have supposed, and that 
it is time for the books to be cast aside and for the theory to 
be rewritten. 


G. H. Hardy, quoted in Bohr [11] 


In 1949, in a review of the Erdds and Selberg elementary proofs of the 
prime number theorem, Ingham wrote, 


What Selberg and Erdés do is to deduce the PNT directly 
... without the explicit intervention of the analytical fact .... 
How far the practical effects of this revolution of ideas will pen- 
etrate into the structure of the subject, and how much of the 
theory will ultimately have to be rewritten, it is too early to 
say. 


A. E. Ingham [71] 


The prime number theorem was proved independently in 1896 by J. 
Hadamard [46] and C.-J. de la Vallée Poussin[23]. Their proofs applied 
complex function theory to the Riemann zeta function. Ingham’s classic 
monograph, The Distribution of Prime Numbers [70], published in 1932, 
contains an analytic proof of the prime number theorem. 

The elementary proof of the prime number theorem was discovered in 
1948 at the Institute for Advanced Study in Princeton. In March 1948, 
Selberg discovered his famous formula (Theorems 9.2 and 9.3) and gave an 
elementary proof of Dirichlet’s theorem on primes in arithmetic progres- 
sions (Theorem 10.9). By April 1948, he knew that A+ a = 2 (Exercises 4 
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and 5 in Section 9.2), and that the prime number theorem is equivalent 
to A= a= 1. Ina letter to H. Weyl that is dated September 16, 1948, 
Selberg! wrote: 


In May I wrote down a sketch to the paper on Dirichlet’s theo- 
rem, during June I did nothing except preparations to the trip 
to Canada. Then around the beginning of July, Turan asked me 
if I could give him my notes on the Dirichlet theorem so he could 
see it, he was going away soon, and probably would have left 
when I returned from Canada. I not only agreed to do this, but 
as I felt very much attached to Turan I spent some days going 
through the proof with him. In this connection I mentioned the 
fundamental theorem to him.... However, I did not tell him the 
proof of the formula, nor about the consequences it might have 
and my ideas in this connection.... I then left for Canada and 
returned after 9 days just as Turdn was leaving. It turned out 
that Turan had given a seminar on my proof of the Dirichlet 
theorem where Erdés, Chowla, and Straus had been present, I 
had of course no objection to this, since it concerned something 
that was already finished from my side, though it was not pub- 
lished. In connection with this Turan had also mentioned, at 
least to Erdés, the fundamental formula. ... 


In a letter to D. Goldfeld that is dated January 6, 1988, Selberg wrote: 


July 14, 1948 was a Wednesday, and on Thursday, July 15 I met 
Erdés and heard that he was trying to prove pn41/pn — 1.... 
Friday evening or it may have been Saturday morning, Erdés 
had his proof ready and told me about it. Sunday afternoon 
(July 18) I used his result (which was stronger than pp+1/Pn 
1, he had proved that between x and x(1 + 6)there are more 
that c(6)x/log x primes for x > x0(6), the weaker result would 
not have been sufficient for me) to get my first proof of PNT. 
I told Erdés about it the next morning (Monday, July 19). He 
then suggested that we should talk about it that evening in the 
seminar room in Fuld Hall.... 


Erdos records the history of the first elementary proof of the prime number 
theorem in the same way: 


In the course of several important researches in elementary 
number theory A. Selberg proved some months ago the follow- 


1This and the following extract from Selberg’s correspondence appear in Goldfeld’s 
historical note [38] 
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ing important asymptotic formula: 


S| (log p)? + oe log plog q = 2x log x + O(z), (9.14) 


psa pqcx 


where p and q run over the primes.... 


Using (9.14) I proved that pn+i/pn — 1 as n — oo. In fact I 
proved the following slightly stronger result: To every c there 
exists a positive 6(c), so that for x sufficiently large we have 


m[a(1+c)] — r(x) > 6(c)x/log ax (9.15) 


where (2) is the number of primes not exceeding z. 


I communicated this proof of (9.15) to Selberg, who, two days 
later, using (9.14), (9.15) and the ideas of the proof of (9.15), 
deduced the prime number theorem.... 


Erdés [34, pp. 374-375] 


Both Erdés [34] and Selberg [128] subsequently gave independent ele- 
mentary proofs of the prime number theorem. These proofs all use Sel- 
berg’s original formula, as well as ideas that Erdés introduced in his proof 
of (9.15). 

Number theorists of Hardy’s and Ingham’s generation believed that there 
could be no elementary proof of the prime number theorem. They were also 
convinced that if, by some miracle, an elementary proof were discovered, 
then the ideas in that proof would lead to tremendous progress in our 
knowledge of the distribution of prime numbers and the zeros of the zeta 
function. Both statements are false. Erddés and Selberg produced elementary 
proofs, but their beautiful method has not led to any extraordinary new 
discoveries in number theory or analysis. 


The elementary proof has so far not produced the exciting in- 
novations in number theory that many of us expected to follow. 
So, what we witnessed in 1948, may in the course of time prove 
to have been a brilliant but somewhat incidental achievement 
without the historic significance it then appeared to have. My 
own inclination is to believe that it was the beginning of impor- 
tant new ideas not yet fully understood and that its importance 
will grow over the years. 


E. G. Straus [136] 


The elementary proof of the prime number theorem that appears in this 
chapter is the proof in Selberg’s original paper [128]. Postnikov and Ro- 
manov [115, 116] give a similar elementary proof in terms of the Mobius 
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function. Daboussi [18] and Hildebrand [67] obtained elementary proofs of 
the prime number theorem that do not depend on Selberg’s formula. Di- 
amond [24] has written a careful survey of elementary methods in prime 
number theory. 

For more recent developments in prime number theory, see Tenenbaum 
and Mendés-France, The Prime Numbers and Their Distribution [140]. D. 
J. Newman has recently published a simple analytic proof (Newman [112], 
Zagier [159]). 

The asymptotic formula for the number of integers with exactly k prime 
factors is based on work of E. M. Wright (see Hardy and Wright [60, 
pp. 368-370]). 

The most important unsolved problem in mathematics is the Riemann 
hypothesis. It can be expressed in terms of the distribution of prime num- 
bers. By Exercise 2 in Section 9.2, the logarithmic integral li(a) is asymp- 
totic to «/log x, and so the prime number theorem can be restated in the 


form 
a(x) ~ li(z). 


The Riemann hypothesis is an assertion about the size of the error term in 
the prime number theorem, namely, that 


a(x) = li(x) +O (ais) 


for every € > 0. 


10 


Primes in Arithmetic Progressions 


10.1. Dirichlet Characters 


Dirichlet’s theorem states that ifm > 1 and a are relatively prime integers, 
then the arithmetic progression mk + a contains infinitely many primes, 
that is, there exist infinitely many primes p of the form p = mk +a. 
Equivalently, the congruence class a (mod m) contains infinitely many 
prime numbers. For example, there are infinitely many primes p such that 
p= 3 (mod 4), and there are infinitely many primes p such that p = 5 
(mod 6), by Exercises 8 and 9 in Section 1.5. 

We begin by constructing a class of periodic functions called Dirichlet 
characters whose domain is the set of integers. 

Let m be a positive integer and let Z/mZ be the ring of congruence 
classes modulo m. The additive group of this ring is cyclic of order m, and 
its dual group is also cyclic of order m. A character of the additive group 
Z/mZ is called an additive character modulo m. 

Let ¢ be a primitive mth root of unity. If w is an additive character 
modulo m, then there exists a unique integer a € {0,1,2,...,m— 1} such 
that 


blk +mZ) = ¢%, 
Choosing the primitive mth root of unity ¢ = exp(2mi/m), we have 


2riak 


Wa(k + mZ) = exp ( ) = €m(ak). 
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Associated to the additive character qq is a complex-valued function w, on 
the integers that is defined by 


Wh (k) = valk + mZ). 


We let w_ denote both the additive character modulo m and its associated 
function on the integers. 

The group of units in the ring of integers modulo m is the multiplicative 
group (Z/mZ)* of order y(m), where y(m) is the Euler y-function. A 
character of this group is called a multiplicative character modulo m. The 
principal character x9 modulo m is the multiplicative character defined by 
xo(a + mZ) = 1 for all a+ mZ € (Z/mZ)*. 

For every multiplicative character y, we have 


x(-1+mZ)? = x(1+ mZ) = 1, 


and so 


x(-1+ mZ) = +1. 


The character y is called even if x(—1+mZ) =1 and oddif y(—1+mZ) = 
—1. 

A multiplicative character modulo m is called real if it is real-valued. 
Since the only real roots of unity are +1, it follows that if x is a real 
character, then x(a + m/Z) = £1 for all (a,m) = 1. The character y is 
called complex if y(a+mZ) is not real for some congruence class a+ mZ. 

Let x be a multiplicative character modulo m. We extend y to the 
nonunits of the ring Z/mZ by setting y(a + mZ) = 0 whenever (a,m) = 1. 


For every odd prime p, the Legendre symbol : defines a real multi- 
plicative character y modulo p by 


1 if ais a quadratic residue modulo p, 
x(a + pZ) = (=) = -—1 if ais a quadratic nonresidue modulo p, 
0 if (a,p)>1. 


By Theorem 3.14, this character is even if p= 1 (mod 4) and odd if p=3 
(mod 4). 

Corresponding to every multiplicative character y modulo m there is a 
complex-valued function x’ on the integers defined by 


x'(a) = x(a+ m2). 


The function y’ : Z — C is called a Dirichlet character modulo m. 
A Dirichlet character x’ modulo m has the following properties: 


(i) The function x’ has period m, that is, if @ = 6 (mod ™m), then 
x'(a) = x'(b). 
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(ii) The support of x’ is the set of integers relatively prime to m, that is, 
x(a) £ 0 if and only if (a,m) = 1. 


(iii) x’ is completely multiplicative, that is, y/(ab) = y‘(a)x‘(b) for all 
integers a and b. 


Conversely, every complex-valued function x’ on the integers that satisfies 
properties (i), (ii), and (iii) is a Dirichlet character modulo m, and the 
multiplicative character y modulo m that corresponds to x’ is defined by 


x(a+ mB) = y'(a). 


From now on, we shall use y to denote both a multiplicative character 
modulo m and the corresponding Dirichlet character modulo m. 

The principal Dirichlet character yo modulo m is defined by yo(a) = 1 if 
(a,m) = 1 and yo(a) = Oif (a,m) > 2. In particular, the principal Dirichlet 
character modulo 1 satisfies xo(a) = 1 for all integers a. 

A Dirichlet character modulo m is called real, complex, even, or odd 
precisely when the corresponding multiplicative character modulo m is real, 
complex, even, or odd, respectively. 

We can state the orthogonality relations for Dirichlet characters as fol- 
lows. 


Theorem 10.1 (Orthogonality relations) Let }), (nod m) denote the 
sum over a complete set of residue classes modulo m, and let D9 (Gaddiae 
denote the sum over the y(m) Dirichlet characters modulo m. If x is a 


Dirichlet character modulo m, then 
a)= ; 
Oe aa 


If a is an integer, then 


> @= y(m) ifa=1 (modm), 
us 0 ifa#1 (mod m). 
x (mod m) 
Proof. This is simply Theorem 4.6 applied to the multiplicative group 
(Z/mZ)*. 


Theorem 10.2 (Orthogonality relations) Let )7, Gnod m) denote the 
sum over a complete set of residue classes modulo m, and let Doe Genoa any 
denote the sum over the y(m) Dirichlet characters modulo m. If x1 and x2 


are Dirichlet characters modulo m, then 


eee 6 ae 


a (mod m) 
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If a and b are integers, then 


0 otherwise. 


> x(a)x(b) = { v(m) if (a,m) =(b,m) =1landa=b (mod m), 


x (mod m) 


Proof. This is Theorem 4.7 applied to the multiplicative group (Z/mZ)*. 


Let d and m be positive integers such that d divides m. There is a natural 
ring homomorphism 


nt: Z/mZ— Z/dZ 


defined by 
m(a+mZ) =a+dZ. 


If (a,m) = 1, then (a,d) = 1 and so zm induces a group homomorphism 
mw: (Z/mZ)* — (Z/dZ)* on the unit groups of these rings. Let A be a 
multiplicative character modulo d. The composition of the maps 


(Z/mZ)* = (Z/dZ)* 2+ c* 
induces a multiplicative character y modulo m defined by 
X= An, 


and so 
x(a + mZ) = (a+ dZ). 


This character is called an induced character. A character y modulo m is 
called a primitive character if it is not induced from a character modulo d 
for any proper divisor d of m. 

Alternatively, we can define induced characters by means of Dirichlet 
characters modulo m. Let d and m be positive integers such that d divides 
m. If is a Dirichlet character modulo d, then we can define a Dirichlet 
character x modulo m by the formula 


Na) if (a,m) = 1, 
x(a) = { 0 if (a,m) A 1. 


Let d,k, and m be positive integers such that d divides k and k divides 
m, and let 4,0, and x be Dirichlet (or multiplicative) characters modulo 
d,k, and m, respectively. If the character \ induces o and the character o 
induces y, then A induces x. 

There is a unique Dirichlet character modulo 1; it is the constant function 
A(a) = 1 for all integers a. For every m > 2, the character A induces the 
principal character yo modulo m. 
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Exercises 


a 
2. 


10. 


Construct all of the Dirichlet characters modulo 5. 


Prove that the nontrivial Dirichlet character modulo 6 is induced by 
a primitive Dirichlet character modulo 3. 


. Construct all Dirichlet characters modulo 4 and modulo 8. Find the 


primitive characters. 


. Let m and d be positive integers such that d divides m. Let be 


a Dirichlet character modulo d, and let x be the Dirichlet character 
modulo m induced by X. Prove that y(a) = A(a)xo(a), where yo is 
the principal character modulo m. 


. Let x be the principal Dirichlet character modulo m. Prove that 


b 


2 x(n) > pa" y(m) 


for all integers a and b. 


. Let x be a nonprincipal Dirichlet character modulo m. Prove that 


b 


S" x(n) < y(m) 


n=a 


for all integers a and b. 


. Prove that for every integer a, 


_ f v(m) ifa=1 (modm), 
Exo ={ 0 ifa#1 (mod Mm), 
x 
where the summation runs over all of the Dirichlet characters modulo 
m. 


. Let y*(m) denote the number of primitive characters modulo m. 


Prove that 


g(m) = >o¢*(d), 


d\|m 


where v(m) is the Euler phi function. 


. Prove that y*(m) is a multiplicative function and that 


Prove that 
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10.2 Dirichlet L-Functions 


We begin by introducing a class of functions that are analytic on half 
planes of the complex plane. The proof of Dirichlet’s theorem, however, 
involves only routine partial summations of the infinite series and infinite 
product representations of these functions on the positive real axis. We do 
not use complex function theory, and, indeed, it would suffice to consider 
the L-functions only for o > 0. 

Let x be a Dirichlet character modulo m. The Dirichlet L-function as- 
sociated with x is the function 


where 
s=artit 
is a complex number with real part #(s) = o and imaginary part S(s) = t. 
For example, if yo is the principal character modulo 3, then 
1 1 1 1 1 


L aI av 
(s, Xo) Fg Wp tee Tiger’ ge 


1 
L(s,x3) =1-= + | dies 


We shall prove that if x9 is the principal character modulo m, then 
L(s,xo) is analytic in the half-plane o > 1, and if x is a nonprincipal 
character modulo m, then L(s, x) is analytic in the half-plane o > 0 and, 
moreover, L(1,y) 4 0. We shall see that this implies Dirichlet’s theorem 
on primes in arithmetic progressions. 


Theorem 10.3 Let y be a Dirichlet character modulo m, and let s be a 
complez number with R(s) = 0 > 1. The function L(s,x) is analytic and 
has the Euler product 


Lox) =[] (1X2) 


Moreover, L(s,x) #0 and 


log L(s, x) =~ 
Pp 


(10.1) 
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Proof. Since 


and 


converges for ao > 1, it follows that the series L(s,) converges uniformly 
and absolutely in the half-plane o > 1+ 6 for every 6 > 0. Similarly, for ev- 
ery prime p, the series 07° 9 x(p")p—** converges uniformly and absolutely 
in the half-plane o > 1, and 


s vm > (1 oT 


Ss 
k=0 P 


Since the character y is completely multiplicative, the Fundamental Theo- 
rem of Arithmetic implies that 


Il (> a _ x(n) 
D s ns 2 


p<« \k=0 neN (a) 


where N(x) denotes the set of all positive integers n divisible only by primes 
p< .«.In particular, ifn < x and p divides n, then p < x, andson € N(z). 
For every € > 0 there exists a number xo(¢) such that, if « > ao(e), then 


va<e 


n>x 


It follows that for x > ao(€) we have 


1(s,x) -T] (1-2) = oo > xo) 


s 
pax n=1 neN (x) 
x(n) 
= De 
= s 
n>x oe 
1 
< a= 
ES ae 
n>x 
ey 


and so the infinite product converges to the L-function, that is, 


vod T0-) 


Pp 


This product is called the Euler product of L(s, x). 
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We shall prove directly that L(s,y) is nonzero for ¢ > 1. Each factor of 
the Euler product is nonzero, since 

1 

ee <5 


gl 
p ra 


and so it suffices to prove that 


-1 
p>Xo 
for some number zo. The inequality 


<< — << 
oe ks = Se ko o _ o 
k=1 P k=1 P - 1 


implies that 


IV 
— 
| 


Choose 29 such that 


2 1 
—<.. 


It follows that for x > x9 we have 


Ley) - a 


Lo<pSax 


(ey 


IV IV 
= 

= 
Me 
= 

. | 
A, Slr 
Ss 


and so 
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Therefore, 
-1 
L(s,x) =|] (1 = x) #0. 
Pp 


For |z| < 1, the principal value of the logarithm has the power series 


1 =e 
log > = log(1 n= : 


log L(s,y) = los] (1-2) © 


since 


p n=2 


¥ 
‘ 


< 1. 


This completes the proof. 


For example, if yo and x3 are the principal and nonprincipal characters 
modulo 3, respectively, then 


L(s, x0) = II (1 a es ae 


p23 
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and 
Hex)= [TL @-797 JT] G+)". 
p=1 (mod 3) p=2 (mod 3) 


Theorem 10.4 Let x be a nonprincipal character modulo m. The Dirichlet 
L-function L(s,x) is analytic in the half-plane o > 0. Let K be a compact 
set in the half-planeo > 0. Fors € K andx >1, 


L(s,x) = >> x) +0 (a), (10.2) 


nN<ax 
where the implied constant depends on m and K. 


Proof. To prove that L(s, x) is analytic in o > 0, it suffices to prove that 
the series }°°°_, x(n)n~* converges uniformly on every compact subset of 
the right half-plane o > 0. 

Let K be a compact subset of the right half-plane. There exist positive 
constants 6 and A such that o > 6 and |s| < A for every s=o+ite€ K. 
We use partial summation (Theorem 6.8) with 


f(n) = x(n) 
and ; 
g(t) = Be 
By Exercise 6 in Section 10.1, F(t) = >>,,<, x(n) < 1 and 
tm) = F(n)g(n) 
ns 
N<n<M N<n<M 
M 
= F(M)g(M) — F(N)9(N) - i F(t)g' (tat 
M 
= a ae +s | FO) 
Ms Ns tstl 
< tpthif Zz t 
Me Ne $ N totl 
2.3 |s| 
N? oN? 
« (1+ cay ge 
6) N® 
1 
ry 


where the implied constants depend on the modulus m and the compact 
set AK. It follows that the partial sums of the series L(s,\y) are uniformly 
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Cauchy on K, and so L(s,y) converges uniformly on K and is analytic in 
the right half-plane. 
Since 


x(n) 7 1 

ye a Sys 

N<n<M 
for all M > N, it follows that 
N le) 
x(n) x(n) 1 
LGQ) =) =O Se 

n=1 n=N 


n nr 


This completes the proof. 


The analytic nature of Dirichlet [-functions is different for principal 
and nonprincipal characters. In the special case where xo is the principal 
character modulo 1, we have xo(n) = 1 for all integers n, and the Dirichlet 
L-function L(s,xo9) for ¢ > 1 is the Riemann zeta function 


love) 1 1 -1 
= —= 1- — . 
¢(s) Do II ( =) 
n= Pp 
Theorem 10.5 Let xo be the principal character modulo m. For a > 1, 
1 

1(s,x0) = 8) J] (1-5) 
plm 


and 


lim (7 — 1)L(o, xo) = [J (1 — | ' 


o-lit 
p|m 


Forl<a< 2, 
1 


T- 


Proof. The Riemann zeta function is not analytic at s = 1, since for 
o >landn> 1 we have 


n+l dy 1 "dx 
mgt ae oe an 
n x n n—-1 ¥ 


and so 


Therefore, 
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and 
lim (o — 1)¢(o) = 1. (10.3) 


If l1<o< 2, then 


1 1 1 
log | ——— } < log ¢(o) < log | ——— ] + logo < log | —— } + log2, 
a-—l ao—1 1 


= o— 


and so 


eee ie ( : :) +0(1). (10.4) 


o— 


If yo is the principal character modulo m, then 


bexe) = TT (- 2) 


Let 1 <o < 2. Then 


(o ~1)(0,x0) = (= 1)6(0) J] (1-5), 


p|m 


and (10.3) implies that 
lim 1)L = | | 1 : 
im, (o — )L(o, xo) eae 


Moreover, 


log L(a,x0) = log¢(c) +log]] (1 - ) 


by (10.4). 
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Exercises 


1. 
2. 


Compute the four Dirichlet L-functions modulo 8. 


A Dirichlet series is a function of the form 
Sa 
n 
n=1 
where {a,,}°2, is a sequence of complex numbers. Prove that if a, = 
O(n), then the series F'(s) converges in the half-plane ¢ > 1+a and 
uniformly in the half-plane o > 1+ a+ 6 for every 6 > 0. 


. A Dirichlet polynomial is a function of the form 


where {a,}_, is a finite sequence of complex numbers. Find the 
zeros of the Dirichlet polynomial 


. Let xo be the principal character modulo 3, and let 3 be the non- 


principal character modulo 3. Prove that 


= 1 
L(s,xo0) + L(s, x3) = 2 >. 38 


n= 
n=1 (mod 3) 


. Let m > 4, and let G be the group of Dirichlet characters modulo m. 


Prove that 


SUsxy= elm) Ss. 


~ 1 
XEG n=1 (mod m) 


. Let k and n be positive integers such that k divides n, let y* be a 


Dirichlet character modulo k, and let x be the Dirichlet character 
modulo m induced by x*. Prove that 


(3.x) = B(s.x°) T] (1-2), 


p|m 


. Let 


fa=>) es, (10.5) 


Prove that 
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(a) f(s) is analytic in the half-plane o > 0, 
(b) 0< f(a) <1 fora >0. 


8. Let 
g(s) =1-2'—, (10.6) 

Prove that 

(a) g(s) is analytic in the entire complex plane. 

(b) g(s) = 0 if and only if s = 1 — 2mik/log2 for k € Z. 

(c) g/(1 — 2mtk/ log 2) = log 2. 

(d) g(o) <0 for0<oa <1. 

(e) (1 a a is meromorphic in the complex plane except for 


simple poles at s = 1 — 2mik/log2 with residues 1/ log 2. 


9. Define the functions f(s) and g(s) by (10.5) and (10.6), respectively. 
Prove that for 0 > 1, 


or, equivalently, 


¢(s) = (1—-2?-*) pee 


Show that the right side of this equation is meromorphic in the half- 
plane o > 0. This determines the meromorphic continuation of the 
Riemann zeta function to the half-plane o > 0. Prove that 


C(a) <0 for0O<o0 <1. 
Use (10.3) to prove that 


> a = log2. 


n=1 


10.3. Primes Modulo 4 


In this section we show that there are infinitely many primes p such that p = 
1 (mod 4), and also infinitely many primes p such that p=3 (mod 4). 
This is Dirichlet’s theorem for modulus 4. The proof is easier than the 
general case, and shows clearly the use of Dirichlet characters and Dirichlet 
L-functions. 
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There are two Dirichlet characters modulo 4. Let xo be the principal 
Dirichlet character. Then 


i= 1 if n is odd, 
xo ~ | 0. if n is even. 


The L-function L(s, yo) converges in the half-plane o > 1, where 


1 Dir oA 
L(sx0) = DG =l+-+54+54+-- 


but the infinite series 


1 1 1 
LO H=lplpe pete 
(1, xo) tatetyt 
diverges. 
Let x4 be the nonprincipal character modulo 4. Then 


1 ifm=1 (mod 4), 


ya(n) = ¢ —-1 ifn=3 (mod 4), 
0 ifn is even. 


The L-function L(s, yo) converges in the half-plane o > 0, where 


L(s,x4) = — eal + bee 


II (+5). 


p=3 (mod 4) P 


ll 
7 
fe} 
o 
& 
a 
Re 
| 
| 
Ne. 
L 


The infinite series 


1 1 1 
Da) Sate Se 
(1, x4) Srp he 


converges, and L(1, 4) > 0. Indeed, 


ae (: +5 7) +(j a)te 


> 0.744 
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ro = (5-2) -(E-9 


< 0.835. 


and 


(Using the power series expansion of the inverse tangent, one can prove 
Leibniz’s formula L(1, x4) = 7/4 = 0.785....) 


Theorem 10.6 For1<o < 2, 


and 


1 1 1 


o 
p=3 (mod 4) P 


In particular, there exist infinitely many primes p= 1 (mod 4) and in- 
finitely many primes p=3 (mod 4). 


Proof. Since L(s, x4) is continuous for o > 0, it follows that 
log L(a, x4) = O(1) for l<o <2. 


Let 1 < o < 2. By (10.1) of Theorem 10.3, we have 


log L(o, xo) = > = +0(1) 


p23 
and (p—1)/2 
_1)(@- 
log L(a, x4) = Se a le + O(1) 
p23 fe 
Therefore, 
1 1 
arg Vlog L(o, x0) + log L(a, x4)) + OU) 
p=1 (mod 4) P 


l| 


1 
3 log L(a, xo) + O(1) 


+O(1), 


l 


Fi 
De aay 


by Theorem 10.5. Since 
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it follows that there exist infinitely many primes congruent to 1 modulo 4. 


Similarly, 
1 1 1 


o 
p=3 (mod 4) P 


and there exist infinitely many primes congruent to 3 modulo 4. 


Exercises 


1. Let yo be the principal Dirichlet character modulo 6, and let xg be 
the nonprincipal Dirichlet character modulo 6. Prove that 


S- zt = 1 (log L(a, xo) + log L(o, x6)) + O(1) 
D 2 


and 


or = 5 (los L(0, xo) ~ log L(o, xe)) + O(1). 


2. Prove that there exist infinitely many primes p = 1 (mod 6) and 
infinitely many primes p=5 (mod 6). 


3. Compute L(1, x6) with an error of at most 0.01. 


10.4 The Nonvanishing of L(1, v) 
In this section we prove that L(1,,) 4 0 for every nonprincipal character 
xX: 
Lemma 10.1 Let xo be the principal character modulo m. Then 
A 
n<ux 5: 
Proof. Observe that 


A(n A(p* 
te on 


n<« pilm pk<« 
k>1 


ey 
eee 


p|m k=1 
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= O(1). 


n<x n<a 
(n,m)=1 
yn al S A(n) 
_ no n 
n<a nex 
(njm)>1 
= logr+O(1) 


This completes the proof. 


Lemma 10.2 Let x be a nonprincipal character modulo m. If L(1, x) #0, 
then 
A 
n<a ve 


Proof. Recall that F(t) = }>,<, x(k) « 1 (Exercise 6 in Section 10.1). 
By partial summation, we have ~ 


3 x(k)logk F(x) loga ie F(t)(1 — log t) a 


x t? 


k<ax 


1 1+ log t 
< a oe dt 
1 


x t? 
<K 1. 
By Theorem 10.4, we have 
d 
So eo): 
d x 
d<a/n 
Using the identity logk = >7,,,, A(m), we obtain 


k)logk 
y Wiest _ Se S>A(n) 


k<a k<au nk 


x(nd) A(n) 
os nd 


nd<x 
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n<a ae 
- > x(n)A(n) (20,9 +0(2)) 
= Has S- x(n)A(n) . 3 x(n)A(n) (") 
= LS a : oO ) 


since 


by Chebyshev’s theorem (Theorem 8.2). Therefore, 


L(1,x) >> _ = O(1). 


n<ux 


If L(1,x) 4 0, then 


This completes the proof. 


Lemma 10.3 Let x be a nonprincipal character modulo m. If L(1, x) = 0, 


then 
Se a = —logr+O(1). 


n<ax 


Proof. Since 


d\n 


we have 


57 x =~ 2 H(A ) log d. 


n<a n<ux 


From the identity 


loge = > S> u(d) ) log x, 


n<au d|n 
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we have 
loge + )> xe) 
>> 1) (a) log 
2 5 xa 1 
= » se) ise : » xe 
-> OHO jog (a. 40 (<)) 
= ied » OD og = 4 Le (<) wee log” 
= uae On) log 5 erik 
ie 
since 


d\ x(d)u(d), oa x 
yo(<) 7 bbe <i, be et 


d<ax 


by Theorem 6.4. If L(1, x) = 0, then 


S- xn) = —logr+O(1). 


nN<ux 


This completes the proof. 


Theorem 10.7 Let y be a complex character modulo m. Then L(1, x) 4 0. 


Proof. Let N denote the number of nonprincipal characters modulo 
m such that L(1,y) = 0. We shall prove that N = 0 or 1. By Lem- 
mas 10.1, 10.2, and 10.3, and the orthogonality relations for Dirichlet char- 
acters (Theorem 10.1), we have 


om y= yA YS ow 


n<a x (mod m) 


= Ss S- x(n)A(n) 


x (mod m)n<a 


n=1 (mod m) 
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= yl, yy xe 


n<a xXA#AX0 Nae 
loga — Nlogx + O(1) 
(1 — N) loga + O(1). 


Since A(n)/n > 0 for all n > 1, it follows that both sides of this equation 
are nonnegative for large x, and so N < 1. Therefore, L(1,y) = 0 for at 
most one nonprincipal character y. 

If x is a complex character modulo m, then ¥ is also a complex character 
and y # xX. We have 


Tid =X = M9) «10,9, 


and so L(1, x) = 0 if and only if L(1,x) = 0. Since N < 1, we must have 
L(1,x) #0 for every complex character x. This completes the proof. 


Theorem 10.8 Let x be a real nonprincipal character modulo m. Then 


L(1,x) #0. 


Proof. Since the character x is real, we have y(n) = +1 for every integer 
n. Consequently, for every prime power p’, 


xt’) =1+x(p)+x(p)? +--+ + x(p)" > 0 


and 


SS" x(p’) >1 ifr is even. 


j=0 


The character x is multiplicative, and so the convolution 


t(k) =1* x(k) = }> x(@) 


dk 
is also a multiplicative function. It follows that 
= [[ t@) = [] Sox’) = 
pr ||k p"||k j=0 


and 
t(k)>1  ifk=m? isa square. 
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Using the asymptotic formula in Theorem 6.9 for the partial sums of the 
harmonic series, we obtain for large x the lower bound 


re) = ey 


k<a m2<a 
~ m 2 
m<xi/2 


Applying the L-function estimate (10.2) in Theorem 10.4 with s = 1 and 
s = 1/2, we have 


yA = 1,9) +0 (4) 


n<ux 


and 


> oF = L(1/2,x) +0 (a). 


n<ux 


Let « > 1 and y = x'/?. By Exercise 6, the set of all lattice points (n, d) 
such that n and d are positive and nd < «x can be partitioned into two 
disjoint sets as follows: The first set consists of all lattice points (n, d) such 
that 1 <n < #'/? and 1 <d< 2/n, and the second set consists of all lattice 
points (n,d) such that 1 <d<a!/? and a/? <n<2a/d. Ifd=x2'/?, then 
a/d = x'/? and there is no integer n such that «!/? <n < a/d. Therefore, 
the second set can also be described as the set of all lattice points (n, d) 
such that 1<d<2'/? and a'/? <n< a/d. We have 


T(x) = Se 
k<au 


k<a n|k 
= S- x(m) 
nd<x (nd)*/ 
_ x(n) 
a 2 ei ee Ne, ys (nd)1/2 
n<al/2 aa d<al/2 gl/2<n<a/d 
_ x(n i x(n) 
- >> x oe aw yy di/2 a n/2° 
n<al/2 d<a/n d<a1/2 wl/2<n<a/d 


We shall estimate these sums separately. By Exercise 7, 


1 
i; = 22? +0 (a-¥?). 
d<a 
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The first sum is 


x(n) 1 
x ni/2 2 di/2 
n<al/2 d<a/n 
21/2 ni/2 
( ni/2 =e @a) 


- 4 
fecatia: 
oep/2 = _ _ oo, _ +0 p>) a 

20? (L(1,x) +0 (a-¥/?)) — eL(1/2,x) +0 (a4) +00) 

= 2L(1,x)e/? + O(1). 


I 


The second sum is 


1 n 
Lm Ll 


d<a1/2 w/2<n<a/d 
1 d}/2 1/4 
- Ya ((za2,0+0 (<2) - (L(1/2,x) + 0 (« ))) 
d<a1/2 " 
1 qi/2 4 
= | —1/4 
a 3 qi/2 (0 (<a) } O (x )) 
d<a1/2 
1 1 
«< + i/4 > q/2 
d<a1/2 
1/4 
< 1+ <q (2% +1) 
<I, 
Therefore, 


T(x) = 2L(1, x)x/? + O(1). 


However, we also have 


] 
T(x) > = 
for sufficiently large x, which is impossible if L(1, y) = 0. Therefore, L(1,y) 4 
0 for all nonprincipal real characters x. 


We can now prove Dirichlet’s theorem. 


Theorem 10.9 (Dirichlet) Let m and a be relatively prime positive in- 
tegers. For 1 <o <2, 


Sea aapee(sa7) 100 


p=a (mod ™m) 


348 10. Primes in Arithmetic Progressions 


In particular, there exist infinitely primes p such thatp=a (mod m). 


Proof. Let 1 < a < 2. Using the orthogonality relations for Dirich- 
let characters (Theorem 10.2) and the estimate (10.1) for log L(s, y) from 
Theorem 10.3, we obtain 


Sy x@betex) = SY yi kex), oa) 
x (mod m) x (modm) P E 
= ve > Xa)x(p) +011) 
Pp x (mod m) 
= om DY a +0(1) 


By Theorem 10.5, the term on the left corresponding to the principal char- 
acter Xo is 


(a) log £0, x0) = lox (>) + 000), 


oe 


1 
= log (4) + S- x(a) log L(o, x) + O(1). 
ao—1 
X#XO 
If y is a nonprincipal character modulo m, then L(1,x.) 4 0 by Theo- 
rem 10.7 and Theorem 10.8, and so log L(o, x) = O(1) for 1 < o < 2. This 
proves that 


5 2 Sum) loz (=) + O(1). 


Therefore, the series oe Gada) p° diverges as o > 1+, and so it must 
have infinitely many terms, that is, there must exist infinitely primes p such 
that p=a (mod m). This completes the proof of Dirichlet’s theorem. 


p=a_ (mod m) 


Finally, we obtain a generalization of Mertens’s theorem (Theorem 8.5) 
to sums of A(n)/n over an arithmetic progression. 


Theorem 10.10 Let m > 1 and a be relatively prime integers. Then 


S- HH) SEG). 


Se te EE) 


Proof. For the principal character yg we have 


Se — = loga + O(1) 


n<ux 
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by Lemma 10.1. For every nonprincipal character x modulo m, we have 
L(1,x) #0 by Theorems 10.7 and 10.8, and so 


A 
n 
n<ux 
by Lemma 10.2. Since xo(a) = 1, it follows that 


S> x(a) > x(n) = x(a) log x + O(1) = log x + O(1). 


x (mod m) nx 


On the other hand, by Theorem 10.2, 


y~ wey Aw _ paw Xax(n) 
x (mod m) n<a n<a x (mod m) 
= om yy 


n<ax 
n=a (mod m) 


This completes the proof. 


Exercises 


1. Let v4 be the nonprincipal character modulo 4. Prove that 


= 1 = 1 
a DD creme ae = 2 Ds iG A 


2. Let x3 be the nonprincipal character modulo 3. Prove that 
L(1, x3) = 2 3 — 
NSPS" £4 Bn + (BNF 2)" 


3. Let x be the Dirichlet character modulo 8 defined by x(3) = x(5) = 
—1. Show that 


= 85k + 32 
L(1,x)=2>5 


& (8k + 1)(8k + 3)(8k + 5)(8K +7) 


4. Let x1 be the real primitive character modulo 5. Prove that L(1, x) > 
0. Let x2 be the complex character modulo 5 defined by x2(2) = 2. 
Prove that the real and imaginary parts of L(1, v2) are positive. 
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5. Let m and a be relatively prime positive integers. Prove that 


logp _ logx 
ss ae + O(1). 


6. Prove that the set of all lattice points (n,d) such n and d are positive 
and nd < x can be partitioned into two disjoint sets as follows: The 
first set consists of all lattice points (n,d) such that 1 <n < y and 
1 <d<_2/n, and the second set consists of all lattice points (n, d) 
such that 1<d<a/yandy<n<_<«a/d. 


7. Compute the constant c such that 


1 
ye ge = 28? c+ Oe”). 
d<a 


Hint. Partial summation. 


10.5 Notes 


Our proof of Dirichlet’s theorem is “elementary” in the sense that it does 
not use complex analysis. Selberg [127] gave a different proof that is, he 
wrote, “more elementary in the respect that we do not use the complex 
characters mod k, and also in that we consider only finite sums.” 

Let m and a be relatively prime positive integers. We denote by (2; m, a) 
the number of prime numbers p < x such that p= a (mod m). By the 
prime number theorem, 


m 


T(x) = S- m(a;m, a) +301 ~ om. 
p|m 


a=1 

(a,m)=1 
The prime number theorem for arithmetic progressions states that for every 
integer m > 3 the prime numbers are uniformly distributed in the y(m) 
congruence classes relatively prime to m, that is, if (a,m) = (b,m) = 1, 
then 

m™(x;m, a) ~ 1(a;m, bd). 
Equivalently, if (a,m) = 1, then 

x 


T(x; m, a) ~ p(m) log a’ 


Selberg [129] also gave an elementary proof of this result. Granville [39] 
reviews elementary proofs of the prime number theorem for arithmetic 
progressions. For an analytic proof, see Davenport [21]. 
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For moduli m > 3 we can describe the comparative prime number race as 
follows. There are y(m) runners, one for each congruence class a relatively 
prime to m. For every positive integer x, the position of runner a (mod m) 
at time x is m(x;m,a). A runner wins the mod m race if it is eventually 
ahead of all the others. Does some congruence class win, or does the lead 
oscillate infinitely often between some or all of the competitors? In the 
case m = 4, Littlewood [94, 54] proved that m(#;4,1) — m(a#;4,3) changes 
sign infinitely often, so no class wins the “mod 4” race. More generally, 
we can ask the following question: Is it true that for every permutation 
@1,--+,@(m) Of the y(m) congruence classes relatively prime to m, we 
have 

T(x;m, a1) < W(x;M, a2) < +++ < A(z;M, ag(m)) 


for infinitely many integers x? This is an open problem in comparative 
prime number theory. For some results on this topic, see Turan [144]. 
In the Notes at the end of Chapter 9, we stated the Riemann hypothesis 
in the form 
a(x) = li(x) +O Ce) 


for every ¢ > 0. In Exercise 9 of Section 10.2 we constructed the meromor- 
phic continuation of the Riemann zeta function to the half-plane o > 0. 
We can now state the Riemann hypothesis in its usual form: If ¢(s) = 0 
with s =o + it and o > 0, then og = 1/2. 


Part III 


Three Problems in 
Additive Number Theory 


11 
Waring’s Problem 


11.1 Sums of Powers 


Lagrange proved that every number is the sum of four squares. This means 
that for every nonnegative integer n there exist nonnegative integers 2}, 
2, 13, L4 such that 

n= 27 +03+02 +23. 
Similarly, Wieferich proved that every number is the sum of nine cubes, 
that is, for every nonnegative integer n there exist nonnegative integers 
%1,---,29 such that 

n= aet+oate: +23. 
These are special cases of Waring’s problem, one of the most famous prob- 
lems in number theory. Waring’s problem states that for every integer k > 2 
there exists a number h such that every nonnegative integer can be written 
as the sum of exactly h kth powers. The smallest such integer h is usually 
denoted by g(k). Since 7 cannot be written as the sum of three squares, 
and 23 cannot be written as the sum of 8 cubes, we can restate Lagrange’s 
theorem as g(2) = 4, and Wieferich’s theorem as g(3) = 9. 

In 1909, the German mathematician David Hilbert proved Waring’s 
problem for all exponents k. The British mathematicians G. H. Hardy and 
J. E. Littlewood subsequently devised a different proof, and their method 
was simplified and improved by the Soviet mathematician I. M. Vinogradov. 
These proofs involve sophisticated techniques of real and complex analy- 
sis, even though the statement of Waring’s problem is purely arithmetic. 
In 1943, another Soviet mathematician, Yu. V. Linnik, devised a proof of 
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Waring’s problem that uses only elementary number theory. In this and 
the following chapter we give Linnik’s proof of Waring’s problem. 

There is a natural generalization of Waring’s problem to polynomials. 
Let f(x) be a polynomial of degree k that is integer-valued, that is, f(a) 
is an integer for every integer x. Every polynomial with integer coefficients 
is integer-valued. There are also polynomials with rational coefficients that 
are integer-valued. For example, the binomial polynomial 


Oe @ _ (a - UE aee ks 1) 


is integer-valued, and every integral linear combination of binomial poly- 
nomials is integer-valued. Moreover, every integer-valued polynomial f(z) 
of degree k can be expressed uniquely in the form 


k k 
x 
f(2) =D ui(e) = ous(4), 
i=0 i=0 
where uo, t1,---,Uz are integers and uz 4 0 (by Exercise 4). This is the 


standard representation of an integer-valued polynomial. 

If f(a) is an integer-valued polynomial of degree k > 1 with positive 
leading coefficient, then there exists a nonnegative integer m such that 
f(m) > 0 and f(z) is strictly increasing for x > m. Let fm(x) = f(a+m). 
Then f(x) is an integer-valued polynomial such that 


A(fm) = thm) bo 


is a strictly increasing sequence of nonnegative integers. The polynomials 
f(a) and f(x) have the same degrees and the same leading coefficients 
(by Exercise 9). Replacing f(x) with fm(x), we can assume that f(a) is an 
integer-valued polynomial such that 


A(f) = {f@}E0 


is a strictly increasing sequence of nonnegative integers. 

Waring’s problem for polynomials states that if the greatest common 
divisor of the set A(f) is 1, then every sufficiently large integer can be 
written as the sum of a bounded number of elements of A(f). If also 0,1 € 
A(f), then there exists an integer h such that every nonnegative integer can 
be written as the sum of exactly h elements of A(f). The classical Waring’s 
problem is the special case f(a) = x*. We shall also prove Waring’s problem 
for polynomials by Linnik’s method. 

In the next chapter we obtain a generalization of Waring’s problem for 
finite sequences of polynomials. 


Exercises 


In this set of exercises we characterize integer-valued polynomials. 
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1. Define bo(a) = 1. For every integer k > 1, define the kth binomial 


polynomial 
=) 


k! 
Compute by(a) for k = 0,1,2,3. Prove that if k > 1 and n > 1, then 
by(—7n) = (—1)*b,(n +k—- 1). 


Prove that if f(x) is a polynomial of degree & with complex coeffi- 
cients, then there exist unique complex numbers uo, u1,..., UR with 
ux #0 such that 


f(z) JS ant) 2: su €e (11.1) 


i=0 i=0 
2. For any function f(x), define the difference operator 
Af(#) = flx+1) — f(a). 
Prove that Abo(x) = 0 and that 
Abs (x) = be-1(2) 
for all k > 1. If 


prove that 


3. A polynomial f(a) is called integer-valued if f(n) is an integer for 
every integer n, that is, if f(Z) C Z. Prove that b,(a) is an integer- 
valued polynomial of degree k for every k > 0. Prove that if uo, ui,..., 
Ux are integers and ux 4 0, then 

: x 
n= Yu(’) 
i=0 
is an integer-valued polynomial of degree k:. 


4. Let f(a) be a polynomial of degree k with complex coefficients. Prove 
that if f(x) is an integer for all sufficiently large integers x, then there 
exist unique integers ug, U1,..-,Ux with uz 4 0 such that 


f(x) = ar er 


i=0 
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Hint: Observe that if k > 1 and f(x) is integer-valued for all suffi- 
ciently large x, then Af(x) is also integer-valued for all sufficiently 
large x. Represent f(a) in the form (11.1) and use induction on k. 


. Let f(x) be a polynomial of degree k with complex coefficients. Prove 


that if f(a) is an integer for all sufficiently large integers x, then f (2) 
is an integer for all integers x. 


. Prove that if f(a) is an integer-valued polynomial of degree k with 


leading coefficient a,, then 


1 
|ax| > a 
k! 


. Let f(x) be an integer-valued polynomial, and define 


d= gced{f(x): 2 €No} 


and 

d' = ged{ f(x) : x € Z}. 
Let uo, U1,..., Ux be integers such that 

& x 
f(a) = bs bis (5) 
i=0 

Prove that 

d=d = (ug, Uiys:., Up) 


. Prove that if 


1=0 
then 
x ie x 
fle) = fee +1) =ue(Z) + ow tun (4), 
i=0 
Prove that 
gcd(uo, ui, ed ,Uk-1, Uk) 
= ged(uo + ur, U1 + Ue, ..., Uk—-1 + Ue, Uk): 


. Let f(a) be an integer-valued polynomial and let m € Z. We define 


the polynomial f,,(x) = f(a +m). Prove that f(x) and f(x) are 
polynomials of the same degree and with the same leading coefficient. 
Let A(f) = {f(d)}%2o-. Prove that ged(A(f)) = ged(A(fm)). 
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11.2 Stable Bases 


A set A of nonnegative integers is called a basis of order h if every positive 
integer can be written as the sum of exactly h elements of A. The set A 
is called a basis of finite order if A is a basis of order h for some h. For 
example, by Lagrange’s theorem the set of squares is a basis of order four. 
Waring’s problem states that for every k > 2, the set of nonnegative Ath 
powers is a basis of finite order. 

Let A = {a;}%2) be an infinite set of nonnegative integers such that 
ag < a, < @g:--. The counting function of A, denoted by A(n), counts the 
number of positive elements of A that do not exceed n, that is, 


A(n)= So 1. 


ajcA 
l<aj<n 


The Shnirel’man density of the set A is 


o(A) = int { “02) sn =1,2,...} 
n 
= sup {a 3) > o forall n=1,2...), 
n 


Then 0 < o(A) < 1 for every set A. If o(A) = a, then A(n) > an for every 
n> 1. 

Let B = {b;}%) be a set of nonnegative integers such that 0 = bp < bi < 
bo <-+-. We construct the subset Ap C A as follows: 


Ap = {av,}720- 


Then 
ao = Aby < Ab, < Abg <i '. 


For example, AN, = A. 

If the Shnirel’man density of B is positive, then Ag is called a subset 
of A of positive Shnirel’man density. The set A is called a stable basis if 
every subset of A of positive Shnirel’man density is a basis of finite order. 
Shnirel’man proved that the set of kth powers is a stable basis for every 
k > 1. We shall also prove this generalization of Waring’s problem. 

A set A of nonnegative integers is called an asymptotic basis of order 
h if every sufficiently large positive integer can be written as the sum of 
exactly h elements of A. We call A an asymptotic basis of finite order if A is 
an asymptotic basis of order h for some h. Let gcd(A) denote the greatest 
common divisor of the elements of the set A. If gcd(A) = d, then every sum 
of elements of A is divisible by d. It follows that the set A is an asymptotic 
basis only if ged(A) = 1. 
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The lower asymptotic density of the set A is 
A 
dz(A) = Himing { 22 SS 12 sx | : 
n 


Then 0 < dz(A) < 1 for every set A. Let B = {b;}% be a strictly in- 
creasing sequence of nonnegative integers. If the lower asymptotic density 
of B is positive, then the set Ag is called a subset of A of positive lower 
asymptotic density. An asymptotically stable basis is a set A that satisfies 
the following condition: If d,(B) > 0 and gcd(Ag) = d, then there exists 
an integer h = h(B) such that every sufficiently large multiple of d can be 
written as the sum of at most h elements of Ag. In particular, Ap is an 
asymptotic basis of finite order for every set B such that dz(B) > 0 and 
gcd(Ag) = 1. 

We shall also prove that the set of kth powers is an asymptotically stable 
basis for every k > 1. 


Exercises 


1. Let A be a set of nonnegative integers. Prove that if o(A) > 0, then 
1eEA. 


2. Let m > 2. Let A, be the set of all nonnegative integers a such that 
a =r (mod m). Compute the Shnirel’man density of A, and the 
lower asymptotic density of A, for r=0,1,...,m-—1. 


3. For k > 2, let A™) = {n* : n € No} be the set of the kth powers of 
the nonnegative integers. Compute the Shnirel’man density of A), 


A, Let Aloo) = ee At), where A) is the set of kth powers. Compute 
the Shnirel’man density of A‘), 


5. Let P be the set of prime numbers and let P’ = P U {1}. Compute 
the Shnirel’man density of P’. 


6. Recall that [x] denotes the integer part of the real number x. Let 
Lo = {{[logn] : n = 1,2,3,...}. Compute the Shnirel’man density of 
ine 


7. Compute the Shnirel’man density of the set Ly = {[nlogn] : n = 
ees es 


8. For 0 <a <1, let La = {[n®logn] : n = 1,2,3,...}. Compute the 
Shnirel’man density of the set Da. 


9. Let A = {a;}S2, be a set of positive integers with 1 = a, < ag < 
a3 <-:-+. Prove that o(A) > 0 if limsup,_,,,(@i41 — ai) < co. 
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10. Let A = {a;}S2, be a set of positive integers with 1 = a, < ag < 
a3 <-:-+. Prove that o(A) = 0 if limj..(ai41 — aj) = 00. 


11. Construct a set A = {a;}2 of positive integers such that o(A) > 0 
and lim sup,_,.,(@i¢1 — ai) = 00. 
12. Let A = {a;}%2) and B = {b;}%p be infinite sets of nonnegative 
integers with 
0 = ag <a, < a2 <-°-, 
0 bo < by < bg <+-, 


l| 


and counting functions A(n) and B(n), respectively. Let Ag(n) be 
the counting function of the set Ag = {ap, }?29. Prove that 


and 
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Let A and B be nonempty sets of integers. The sumset A+ B is the set 
consisting of all integers of the form a+ b, where a € A and b € B. The 
difference set A — B consists of all integers of the form a — b, where a € A 
and be B. 

If A,, Ao,..., Ap are h sets of integers, then 


Ay +Ag+-:++Apn 


denotes the sumset consisting of all integers of the form a, + a2 +--:+4n, 
where a; € A; fori =1,2,...,h. If A; = A for alli =1,2,...,h, we let 


hA=A+t+::-+A. 
a 
h times 


Then A is a basis of order h if No C AA, that is, if the sumset hA contains 
every nonnegative integer. The set A is an asymptotic basis of order h if 
hA contains every sufficiently large integer. 

Let A be a set of integers. If A contains every positive integer, then 
A(n) =n for all n > 1 and A has Shnirel’man density o(A) = 1.Ifn¢A 
for some n > 1, then A(n) < n—1 and 

A(n) 


1 
GAs ies Su. 
n nr 
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Thus, o(A) = 1 if and only if A contains every positive integer. 

Shnirel’man density is an important additive measure of the size of a 
set of integers. In particular, the set A is a basis of order h if and only if 
a(hA) = 1, and the set A is a basis of finite order if and only if o(hA) = 1 
for some h > 1. Shnirel’man made the simple but extraordinarily powerful 
discovery that if A is any set of integers that contains 0 and has positive 
Shnirel’man density, then A is a basis of finite order. It follows that. if 
a(A) = 0 but o(h1A) > 0 for some integer hi, then the sumset h,A is a 
basis of finite order, and so A is also a basis of finite order. This is a key 
idea in our proof of Waring’s problem. Although the set A“ of nonnegative 
kth powers has Shnirel’man density zero, we shall prove that there exists 
an integer h; such that the set h, A“) of all sums of hy nonnegative kth 
powers has positive Shnirel’man density. 


Lemma 11.1 Let A and B be sets of integers such that 0 € A and0 € B. 
If A(n) + B(n) > n, thenn€ A+B. 


Proof. If n € A, then n =n+0€ A+B. Similarly, if n € B, then 
n=0+nEA+4+B. 
Suppose that n ¢ AU B. Define sets A’ and B’ by 
A'={n-a:a€ A,l<a<n-l} 
and 
B’ = BN(1,n—-1]. 
Then |A’| = A(n), since n ¢ A, and |B’| = B(n), since n ¢ B. Moreover, 
A'UB' C [1,n— 1]. 
Since 
|A’| + |B] = A(n) + B(n) 2 1, 
it follows that 
ANB FO. 
Therefore, n —a = b for some a € A andb€ B,andson=a+bE A+B. 


Lemma 11.2 Let A and B be sets of integers such that0 € A and0 € B. 
If o(A) + 0(B) > 1, thenNo CA+B. 


Proof. We have 0=0+0€A+8B. If n> 1, then 
A(n) + B(n) > (o(A) + o(B))n > n, 


and Lemma 11.1 implies that n € A+ B. 
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Lemma 11.3 Let A be a set of integers such that 0 € A and o(A) > 1/2. 
Then A is a basis of order 2. 


Proof. This follows immediately from Lemma 11.2 with A = B. 


Theorem 11.1 (Shnirel’man) Let A and B be sets of integers such that 
0€ A and0€ B. Let o(A) =a and o(B) = B. Then 


o(A+ B) >at+6B-af. (11.2) 
Proof. Let n > 1. Let ag = 0 and let 
L<ay<-:-<ap<cn 


be the k = A(n) positive elements of A that do not exceed n. Since 0 € B, 
it follows that a; =a; +0¢€A+B fori=1,...,k. Fori=0,...,k—1, let 


L<bi <--+ <b, S aig. -—aj—-1 


be the r; = B(aj41 — a; — 1) positive integers in B that are less than 
Aji41 — Qj. Then 


a, <a, +b) < +++ <a; tb, < ai4qn 


and 
a,+b;€ A+B 


for 7 =1,...,7;. Let 
1<bh <-++ <b, <n— ay 


be the ry = B(n — ax) positive integers in B that do not exceed n — ay. 
Then 
ap < ap t+ by <+++< apt bp, <n 


and 
ap+b; € A+B 


for 7 =1,...,7rx. It follows that 


k 
(A+B)(n) > A(n)+ Sor; 


1=0 
k-1 
= A(n)+ >> Blaiy1 — a; — 1) + B(n— a) 
i=0 
k-1 
> A(n)+8> > (dizi — a; — 1) + B(n— ag) 


1i=0 
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A(n) + Bn — Bk 
B)A(n) + Bn 
B)an + Bn 
at 8 — a8)n, 


{=< 
T= 


IV 


( 
( 
( 


and so i 
(A+ BO) 4 4p — ap 


for all positive integers n. Therefore, 


(A+B) = int { SPO) 51,2, beat p—as, 


n 


This completes the proof. 


Inequality (11.2) can be expressed as follows: 
1—o(A+ B) < (1—a(A))(1 — o(B)). (11.3) 


We can generalize this inequality to the sum of any finite number of sets 
of integers. 


Theorem 11.2 Let h > 1, and let Aj,..., An be sets of integers with 
0€ A; fori =1,...,h. Then 


1=6(Ay 4-4 A,) = T= oA). 


= 


1 


Proof. This is by induction on h. Let o(A;) = a; for i = 1,...,h. For 
h =1, there is nothing to prove, and for h = 2 the inequality is equivalent 
to (11.3). 

Let h > 3, and assume that the theorem holds for h — 1 sets. Let 
Aj,...,Apn be h sets of integers such that 0 € A; for all 7. Let B = 
Ay +-+:+Ap_1. We have the induction hypothesis 


3 
Il 


h-1 
1 —o(B) =1—o0(Ar +--+ + An-1) < TP] (Q—o(Ad), 
and so 
1—o(A, +---+ An) = 1—o(B+ Ap) 
< (1-o(B)(1— o(An)) 
h-1 
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This completes the proof. 


Theorem 11.3 Let 0 <a< 1. There exists an integer h = h(a) such that 
if Ai,..., An are sets of nonnegative integers with 0 € A; and o(A;) >a 
for alli =1,...,h, then 


Ait tAy = Ne 


Proof. Since 0 < 1 —a < 1, there exists a positive integer h, such that 


Let h = 2hy, and let A;,..., Ap, be sets of nonnegative integers with 0 € A; 
and o(A;) > a for i = 1,...,h. We define A = Ay +---+ Ap, and B= 
An, +1 +:+::+ Agn,. By Theorem 11.2, 


hy 
o(A) =0(A, +--+ + Ap,) 2 1-[]G-o(A,)) 21-(1- a)" > 


i=l 


Nl re 


Similarly, 
o(B) = o(Ansa +++ Aan) 2 5: 
Applying Lemma 11.3, we obtain 
Ay +++++A, = A+ B= No. 


This completes the proof. 


Theorem 11.4 (Shnirel’man) Let A be a set of nonnegative integers 
such that 0 € A and o(A) >0. Then A is a basis of finite order. 


Proof. Let a = o(A). The result follows from Theorem 11.3 with A; = A 
fori=1,...,h(a). 


Theorem 11.5 Let A be a set of nonnegative integers with 0 € A such 
that o(h,A) > 0 for some positive integer hy. Then A is a basis of finite 
order. 


Proof. If o(h, A) > 0, then there exists an integer h2 such that h, A isa 
basis of order hg, that is, every nonnegative integer is a sum of h2 elements 
of h,A. Since 

ho(h, A) = (hiha)A, 


the set A is a basis of order h = hyho. 
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Theorem 11.6 Let B be a set of nonnegative integers with 0 € B and 
gcd(B) = 1. If dz (B) > 0, then B is an asymptotic basis of finite order. 


Proof. The set A = BU {1} has positive Shnirel’man density (by Ex- 
ercise 1), and so A is a basis of order h; for some positive integer hy. It 
follows that every nonnegative integer can be written in the form u + J, 
where 0 < 7 < hy, and u is a sum of hy — j elements of B. Since 0 € B, 


Wwe (hy —j)B ChB. 


If B is any set of relatively prime positive integers, then, by Theorem 1.16, 
there exists an integer no = no(B) such that every integer n > no can be 
represented as a sum of elements of B. Since 0 € B and gcd(B) = 1, there 
exists a positive integer hg such that 


No +79 EhyB 


for 7 = 0,1,...,h1. Let h = hy + ho. If n > no, then n — no > 0 and we 
can write n — no in the form u+ j, where u € hy B and 0 <j < hy. Then 


n=ut(no+j)€mB+heB=hB, 


and so B is an asymptotic basis of finite order. 


Theorem 11.7 Let B be a set of nonnegative integers with gcd(B) = d. 
If dz(B) > 0, then every sufficiently large multiple of d is the sum of a 
bounded number of elements of B. 


Proof. The set d~! x B = {b/d: b € B} consists of nonnegative integers, 
and 
A={0}Ud'*B 


is a set of nonnegative integers with 0 € A and gced(A) = 1. By Theo- 
rem 11.6, every sufficiently large integer can be represented as the sum of 
exactly h elements of A, and so every sufficiently large multiple of d can be 
represented as the sum of at most h elements of B. 


Exercises 


1. Let A be a set of nonnegative integers. Prove that o(A) > 0 if and 
only if 1 € A and dz (A) > 0. 


2. Let hy and hg be positive integers with hy < he, and let A bea 
nonempty set of integers. Prove that 


hy A+ hgA = (hy + ho)A. 
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Prove that 
hi A — hg A = (hy — ho)A 
if and only if |A| = 1. 


3. Let A be a set of nonnegative integers such that 0 € A and 


Prove that 
o(2A) > 50(A). 


Use this to give another proof of Theorem 11.4. 


4. Let A be a set of nonnegative integers such that 0 € A, A 4 {0}, and 
hA=(h+1)A for some positive integer h. 
(a) Prove that hA = ¢A for all > h. 


(b) Prove that hA is periodic, that is, there exists a positive integer 
m such that if b€ hA, then b+me€hA. 


(c) Let d = gced(A). Prove that hA ~ dx No, that is, the sumset hA 
eventually coincides with the set of all multiples of d. 


11.4 Waring’s Problem for Polynomials 
Let f(a) be an integer-valued polynomial of degree & such that 


A(f) = {fF }E0 


is a strictly increasing sequence of nonnegative integers. Let d be the great- 
est common divisor of A(f). By Exercises 5 and 7 in Section 11.1, the 
polynomial f(a) /d is also integer-valued of degree k, and the greatest com- 
mon divisor of A(f(a)/d) is 1. Without loss of generality, we can assume 
that f(a) is an integer-valued polynomial with gcd(A(f)) = 1. 

Let NSE denote “the number of solutions of the equation.” We define 
representation functions r ,,(n) and Ry,s(N) for the polynomial f(x) by 


rps(n) = NSE {f(v1) +---+f(vs) =n: 01,...,05 € No} 


and 
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Lemma 11.4 Let f(x) = iar asx’ be an integer-valued polynomial of 
degree k with leading coefficient a, > 0. Let 


Wap. of ae 
a*(f) = (lax i] +|@x 2| + + laol). (11.4) 
ak 
Ifa > ax*(f) ts an integer, then 
k 3 k 
ee iG (11.5) 
2 2 
If N is sufficiently large, then 
s/k 
1 /2N 
Rys(N = ‘ 11.6 
nal) > 3 (Se) (11.6) 
Proof. Since 
k Gk—-1 , Gk-2 ao 
= 1 | bee , 
SS ( cs Ant nx? | 
it follows for x > x*(f) that 
(2) fi at | Seat gy Oke ao 
anak Ant = px? anak 
< law-al , len2l | aol 
Ape Apx? anxk 
e< lek=1l + lana] +--+ + Jao! 
= aR 
_ af) 
2x 
eZ 1 
5 
This proves (11.5). 
If x1,...,%5 are integers such that 
; on \ VE 
wh) Sot (=) 
for 7 =1,...,s, then 
3ap@: N 
(SS fax 7: Se 


and 
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The number of integers in the interval 


(0. (22) "] 


6oe e(f)-1, 


3aKS 


1/k $ s/k 
rte) > ((2%) wn-1) 23 (4) 


for N sufficiently large. This proves (11.6). 


is greater than 


and so 


Lemma 11.5 Let f(x) = pa 
degree k such that 


A(f) = {f(t20 
is a strictly increasing sequence of nonnegative integers. Define x*(f) by (11.4) 


and let cor 
nif) = 


For N > N(f), if @1,...,U%5 are nonnegative integers with 


ajx’ be an integer-valued polynomial of 


(11.7) 


Yo flay) <N, 
j=l 
then 
0 <a; < (2kIN)'/* fOP 7 SH My ih e.5.8s 


Proof. Recall that k!a; > 1 by Exercise 6 in Section 11.1. If N > N(f) 
and a; > (2k!N)!/* > x*(f), then 


k 
ane; 


f(x;) > > kla,N > N, 


and so 


S~ f(ai) > Flas) > N 


i=1 


This completes the proof. 


A critical part of Linnik’s solution of Waring’s problem is the following 
result, which is a special case of Theorem 12.3. 
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Theorem 11.8 Let {s(k)}9°, be the sequence of integers defined recur- 
sively by s(1) = 1 and 
s(k) = 8k2los2s@—D] for k > 2. 
Letc>1 and P > 1. If 
k 
f(z) = SS a,x" 
i=0 
is an integer-valued polynomial of degree k such that 
|a;| < cP*-* fori=0,1,...,k, 
then for every integer n, 


nog i Xj=iflaj)=n with x; € \ 5 
{ and |x;|<cP for j =1,...,s(k) ie 


Proof. Let c= c; and f;(x) = f(x) for 7 =1,...,s(k) in Theorem 12.3. 


Theorem 11.9 Let f(x) = uae a;x’ be an integer-valued polynomial of 
degree k with a, > 0 and gcd(A(f)) = 1. Then A(f)U{0} is an asymptotic 
basis of finite order, that is, for some h and every sufficiently large integer n 
there exists a positive integer hn, <h and nonnegative integers x1,...,Xp 
such that 


n 


f(ai) +--+ +f (en,,) = 7. 


Proof. Define N(f) by (11.7), and let s = s(k) be the integer constructed 
in Theorem 11.8. Let W = sA(f) be the set consisting of all sums of s 
integers of the form f(x) with « € No. We shall prove that the sumset W 
has lower asymptotic density d,(W) > 0. 

Let W(N) be the counting function of W. Choose c > (2k!)!/* and choose 
N > N(f) sufficiently large that for P = N\/*, 


|a;| << cP*~* fori =0,1,...,k. 


Then 0 < az < c. By Lemma 11.5, if x1,...,”, are nonnegative integers 
such that )7_, f(vj) < N, then 


O<2;<(2kIN)* <cP for j=1,...,s. 


We get upper bounds for ry .(n) and Ry,s(N) as follows: If 0 <n < N, 
then 


rys(n) = NSE{f(ai)+---+ f(t.) =n: a; € No} 
< NSE {f(e1) +--+ + flats) =: [ay] < cP} 
a 
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by Theorem 11.8, and so 


Rps(N) = do rps(n) 


Kk W(N)P** 
W(N 
<Kk,c ( 7 ’ P*, 


We can apply Lemma 11.4 to obtain a lower bound for Ry,.(N). For N 
sufficiently large, 


LON ee. Tyan : 
Ry,s(N) > 5 Pols) hehe 


Therefore, 
W(N 
P® <hk,c Ry s(N) <k,ec (37) H Bee 
and so W(N)/N >>x,- 1. It follows that 


dr(sA(f)) = dz(W) > 0, 


and the result follows immediately from Theorem 11.7. 


Theorem 11.10 Let f(x) be an integer-valued polynomial of degree k with 
leading coefficient a, > 0. If 0,1 € A(f) = {f(x): a € No}, then A(f) is 
a basis of finite order. 


Proof. This is a consequence of Theorem 11.9. 


Theorem 11.11 (Waring—Hilbert) For every k > 2, the set of nonneg- 
ative kth powers is a basis of finite order. 


Proof. This is the special case of Theorem 11.10 applied to the polyno- 


mial f(«) =a". 


Theorem 11.12 Let f(x) be an integer-valued polynomial of degree k with 
leading coefficient a, > 0 and gcd(A(f)) = 1. Then A(f) U {0} is an 
asymptotically stable asymptotic basis of finite order. 
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Proof. This requires only minor modifications of the proof of Theo- 
rem 11.9. Let A(f) = {f(i)}?2o, and let B be a set of nonnegative integers 
of lower asymptotic density dz(B) = 6 > 0. Then 


Ap = {f(b):b€ Bh. 


Let s = s(k) be the integer constructed in Theorem 11.8. The sumset 
W, = sAg consists of all sums of s integers of the form f(b) with b € B. 
Let W,(N) be the counting function of the sumset W,. Let r\2)(n) denote 


the number of solutions of the equation 
f(b1) +++ + f(bs) =n 
with b;,...,6; € B, and let 


N 
RYO (N) = Dore (n). 
n=0 


We shall again compute upper and lower bounds for RY (n). 
Choose real numbers c > (2k!)!/* and N > N(f) such that for P = N1/*, 


la;| <eP*—* fori=1,...,k. 


By Theorem 11.8, we have the upper bound 


N N 
B B 
RO) = SOs Se ee 
Benoa Benoa 


<kc Wap(N)P%* 


Kis (“er pe 


for all sufficiently large N. 
To obtain a lower bound, we observe that the number of integers b € B 
such that 


1k 
ae ) (11.8) 


34,8 


r(f) <b< ( 


is 


‘ (2)") Bt (3) ()" Pe eee? 


for sufficiently large N. By Lemma 11.4, if b € B satisfies inequality (11.8), 
then 


0< fe) <~ 


ne 
Ss 
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and so Z 
RON) Se P*. 


It follows that We(N)/N > x,- 1, and so Wg = sAg has positive lower 
asymptotic density. The result now follows from Theorem 11.7. 


Theorem 11.13 Let f(x) be an integer-valued polynomial of degree k with 
leading coefficient a, > 0. If 0,1 € A(f) = {f(x): a € No}, then A(f) is 
a stable basis of finite order. 


Proof. This follows from Theorem 11.12. 


Theorem 11.14 (Waring-Shnirel’man) For every k > 2, the set of 
nonnegative kth powers is a stable basis of finite order and an asymptotically 
stable asymptotic basis of finite order. 


Proof. This follows from Theorem 11.12. 


Exercises 


1. Prove that every multiple of 6 can be written as the sum of a bounded 
number of integers of the form x(x — 1)(a# — 2) with x € No. 


2. Prove that for every k > 1 there is an integer h(k) such that every 
positive integer can be written as the sum of at most h(k) kth powers 
of odd numbers. 


11.5 Notes 


Nathanson’s Additive Number Theory: The Classical Bases [104] contains 
proofs of Lagrange’s theorem that every number is the sum of four squares, 
and Wieferich’s theorem that every number is the sum of nine cubes. A 
proof of Lagrange’s theorem that depends on the geometry of numbers 
appears in Nathanson [103]. Jacobi’s formula for the number of representa- 
tions of an integer as the sum of four squares is Theorem 14.4 in Chapter 14 
of this book. 

In 1909 Hilbert [66] gave the first proof of Waring’s problem for all expo- 
nents & > 2. Hardy and Littlewood [55, 56] developed a different method 
of proof and obtained an asymptotic formula for rz,,(n). Vinogradov [150] 
simplified and improved the circle method of Hardy and Littlewood, and 
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obtained new results on Waring’s problem. Nathanson’s book [104] gives 
Hilbert’s proof of Waring’s problem and also a proof of the Hardy—Littlewood 
asymptotic formula. Vaughan [148] is the standard reference on the circle 
method. 

This chapter contains Linnik’s elementary proof of Waring’s problem. 
Linnik [93] published this proof in 1943. An exposition of Linnik’s proof 
also appears in Khinchin [78]. Rieger [122] refined Linnik’s method to obtain 
an upper bound for the smallest integer g(k) such that every nonnegative 
integer is the sum of g(k) kth powers. This upper bound is much larger 
than the upper bound obtained by the circle method. 

Kamke [76] proved Waring’s problem for polynomials. Nechaev [109] has 
applied classical analytic techniques, that is, exponential sums and the 
circle method, to Waring’s problem for polynomials. Kuzel’ [86] observed 
that Linnik’s method for the classical Waring’s problem also applies to 
Waring’s problem for polynomials. 
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Sums of Sequences of Polynomials 


12.1 Sums and Differences of Weighted Sets 


In this chapter we complete our study of Waring’s problem by Linnik’s 
method. We shall derive a fundamental upper bound for the number of 
representations of an integer as a sum of polynomials. In Chapter 11 we 
applied a special case of this result to solve Waring’s problem for a single 
polynomial. In Section 12.4 we shall use the full strength of this upper 
bound to obtain a generalization of Waring’s problem to sequences of poly- 
nomials. 

We begin with the definition of a weighted set. A weighted set is a pair 
(A, wa), where A is a set and wa is a function (called the weight function) 
defined on A. In this chapter weighted sets are always finite sets of integers, 
and the range of the weight functions is the set of nonnegative integers, that 
is, wa(a) € No for all a € A. Thus, we can think of a weighted set as a 
set with multiplicities, that is, a set in which the element a occurs or is 
counted w(a) times. 

There are natural ways to generate weighted sets. If (A, wa) is a weighted 
set and A is a subset of A*, then we can define the weighted set (A*, wa«) 
by 


war(a) = { wa(a) ifae A, (12.1) 


0 iface A*\ A. 


Let (Ai, wa,),---,(An,wa,) be weighted sets. The product set A; x 
--» x A, consists of all htuples (a1,...,a,) with a; € A; fori = 1,...,h. 
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We define a weight function on the product set by 
WA,x-xAn(@1,+++,@n) = WA, (a1) +++ Wa, (Gn). 


Let f : Ay x--- x Ap — B be a function defined on the product set. We 


define a weight function wi? on B as follows: 


w\!)(b) —= S- WA, xx Ap (Q1, +++, On) 


We can think of wif)(b) as counting the weighted number of solutions of 
the equation f(a1,...,@p,) = b. 
For example, if A,,..., A, are weighted sets of integers, then the sumset 


S= Ay 4A), 


is the image of the function o(a1,...,a,) = a1 +--+: + a, defined on the 
weighted product set A, x --- x A,. The weight of an element s € S is 


wi? (s) = S- wa, (a1) -+- wa, (aa). 


If wa,(a;) = 1 for alli =1,...,h and a; € Aj, then w?)(s) is simply the 
number of representations of s in the form a; + --- + aa. Similarly, if we 
define 6 : Ay x Ap > A, — Ag by 6(a1, a2) = a — ag, then the difference 
set 

D= A, — Ap = {a1 — ag : ay € Ay, a2 € Ag} 


is a weighted set of integers such that the weight of d € D is 
wy @M= So wa, (awa, (a2): 
(a1,a2)€A1 x Ag 


ay —ag=d 


Let NSE denote “the number of solutions of the equation.” If f is a 
function from the product set A; x --- x A, into a set B, then 


f(ai,...,@n) =6 _ 
nse { with a; € A; fori=1,...,h Es Ss H 
If (Ai, wa,),---, (An, wa,,) are weighted sets with wy,(a;) = 1 for all i = 


1,...,h and a; € Aj, then 


(f) -_ f(ai,...,@n) =0 
WB (0) = NSE { with a; € A; fori =1,...,h 
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If wt is an upper bound for the weight function w4,, that is, if wa,(a;) < uw? 
for alli=1,...,h and a; € Aj, then 


w(b) = S- wa, (a1) ++ wa, (an) 


IA 
M 
& 
= 


(Aq ses apn)EALX:-XAp 
Faqs ap)=b 


— * oe. ay* f(a1,...,an) =6 
= UW, WN ee head : 


For brevity, we shall often refer to the weighted set (A,w,) as the 
weighted set A. 

Let A,, Az, and A3 be weighted sets. We can form the weighted sumsets 
S, = A, + Ag and Sp = Ag + A3, and from these the weighted sumsets 
S,+As3 and Ay +S. We also have the weighted sumset S = A, + A2+ A3. 
By the associativity of set addition we have S = S;+ A3 = A, 4+ S2 as sets. 
In fact, these sets are also equal as weighted sets, that is, for every s € S 
we have 


ws(s) = WS\+A3 (s) = WA\4+S2 (s). (12.2) 


This is a special case of the following theorem, which shows that weights 
constructed by composition of functions are well-defined. 


Theorem 12.1 For @ > 2, let h,ro,r,...,r¢ be integers such that 
0O=1719 <r << re=h. 


Let (Ai, wa,),---;(An, wa,) be weighted sets and let By,..., Be, and C be 
sets. Fori=1,...,2@, let 


fi : Ap,_441 Kee XK Ap, — B; 
be a function defined on the weighted product set A,,_,41 X-+:X A,,. Then 
fi induces a weight function wif?) on the set B;, and these weight functions 
determine a weight function on the product set By x --- x Be. Let 


g: By x-:-x Be C 


be a function defined on the weighted product set B, x --- x Be. Then g 
induces a weight function wl? on C’. Define the function 


fi: Arx-++x Anpoc 
by 
f(ai,---,@n) 


— g(filar, 3 ee fo(G@r141, hia prs )3 oe +) Fe(Gre_.41) a + Ory). 
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Then f induces a weight function wf? on C. For allc € C we have 


we (c) = we? (0), 


that is, 


S- WA, X- x Ap (@1, +++) Gn) 


(44 5-.-,4p,)EAYX*X AZ, 
f(a1,..-,apn)=e 


= S- WB, x--x B(b1,---, be). 


(by ,...,bg)€By X---X Bg 


Proof. This is a straightforward calculation. We have 


wi) (c) = S- WB xx By (b1,-++ , be) 


(by ,...,bg) EBL XX Bg 
g(b1,...,bg)=e 


- SS wg (1) WE? be) 


(by ,.--,bg)EByX---xX Bg 
g(b1,...,bg)=c 


TL 


= S- S- wa,(ai) | X-- 


(b1,...,bg)€ By X--- x Bg (a1,-..,4ry )EALXX Ary t= 


x ye II WA; (aj) 


(Opp _ ytd Org)EAry 441 XxX Ar, t=re_1tl 
Felarp_ 4 $1 reserg )=be 


= ys » 


(b1,.--,bg)EBy X---xX Bg (Ay s--Ory )EALX KAP 


g(b15.--,bg)=e f1(@15--ary )=b1 
h 


S- WA; (ai) 


(app pear argE Ary 141% X Ary t= 
Folary tis arg )=be 


h 
= S [[ 4.(a) 
(a1,...,a, )€ALX+X AR w=1 
I( Pi (ap sary Jes Se(@ry_y 41s arg) =e 


h 


= S- WA; (ai) 


(@1,;---,€p)EALX*XAQ I= 
f(az,-..,a;,)=e 


= S- WW Ag Seisety (Cin s2 5 Oi) 


(ay 5--,4,)EAYX KX AQ 
f(aq,...,a7, =e 
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= wc). 


This completes the proof. 


Lemma 12.1 Let B, and Bo be weighted sets of integers. Define the ad- 
dition map o : By x By > By, + Bo by o(b1, be) = b1 + bg and the difference 
maps 6; : B; x B; > B,— B, by 6;(b;, bf) = b; — bi for i = 1,2. Consider the 
weighted sumset S = B,+ Bz and the weighted difference sets D, = B,— By, 
and Dz = By — Bg. Then for all integers n, 


w9(n) < 5 (wSP 0) + wh). 


Proof. For i = 1,2 we have 
(64,04) € By x By bE Bi 
b; —bi =0 


To each 6; € B, there exists at most one bg € By such that b; + bo = n. 
Applying the elementary inequality 


(a? + y’) for z,y ER, 


Nl Re 


ry < 


we obtain 


wi)(n) = Swe, (b1) we, (b2) 


(by, get x Bo 
by +bg=n 


1 
< S- 9 (wa, (b)? + WBe (b2)”) 
(61,b2)€ By x Bg 
by +bg=n 
< bs wel)? + >> vat) 
bie Bi b2€ Bo 


1 
2 
1 6 6 
= 5 (wi? (0) 4: wh? (0)) 


This completes the proof. 


Lemma 12.2 Fort > 1, let By,..., Bot be weighted sets of integers, and 
let S be the weighted sumset 


S= By +--+ By 
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with weight function determined by the addition map a: B, x--- x Bot - 
By +--++ Bo. Fori=1,...,2', consider the weighted difference sets 


D; = 2°-1B; — 2°-1B; = 2*-1(B; — Bi) 
with weight functions defined by the maps 
6;: By x---x B, > Dj, 


6:(bi1, sey b; at) => (bi, + peounal + bj 2t-1) = (b; 9-144 + EN + b; at). 


Then for all integers n, 


wWOm<= Y w$?O. (12.3) 


Let B be a weighted set with weighted sumset S = 2'B and weighted differ- 
ence set D = 2'-!B —2'-!B. Then 


wo? (n) < w'5?(0) (12.4) 
for all integers n € S. 


Proof. The proof of (12.3) is by induction on t. The case t = 1 is 
Lemma 12.1. 
Let t > 2, and assume that the lemma holds for t — 1. Consider the 
weighted sumsets 
S, = By +---+ Bot-1 
and 
So = Bot-144 + vy + Bot 


(01) ) 


with weights ws,°’ and w ie , respectively, and the weighted difference sets 
T, = 8, — Sy 


and 
Tz = Sp — So 


with weights wy?) and ee 


, respectively. Since 
S=S)+4+ So, 
we can define an addition map a’: S; x Sz, — S. By Theorem 12.1, 


w¥(s) = w¥(s) 


for all s € S. (Indeed, Theorem 12.1 implies that all of the weight functions 
constructed in this proof are well-defined.) 
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By Lemma 12.1, 


wH(s) <5 (wf + w80)) 
for alls € S. Fori=1,...,2°, we define the weighted difference sets 
Bi = B; — B,. 
Then 
MN = 3-8, 

= (By t+ + By) — (Br +++ + By) 

= (Bi — By) +--+ + (Bgr-1 — Bge-1) 

= Byt+---+ Byer. 
Similarly, 


To = $2 — Sg = By-141 +e + Bh. 
For i=1,...,2°, we define the weighted difference sets 
Di =o" B27 BR! 


with weight functions wer, By induction, the lemma holds for sums of 


2*-! weighted sets. T herefore, we have 


gi-1 
A 1 6 
wy?) (0) = 9t-1 Se w's? (0) 
t=1 
and 
1 = (51) 
A ( 
wi)(0) < = S- wy!’ (0), 
j=2t-141 
and so 
1) (Ad) (Aa) 1S 6 
ws (n) < 5(w? 0) + why?” (0)) = = do wb? (0). 
i=1 
Since 
DD, =P 2B 


= 2*-?(B; — B;) — 2*-?(B; — B;) 
= 9t-1p. —9t-1p. 
= D;, 

it follows that 


Inequality (12.4) follows immediately from (12.3). 
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Exercises 

1. Let A = {0,1,3,4} be a weighted set with weight function w4(a) = 1 
for all a € A. Compute the weight functions of the weighted sumset 
2A and the weighted difference set A — A. 

2. Let A = {0,1,3,4} be a weighted set with weight function w4(a) = a 
for all a € A. Compute the weight functions of the weighted sumset 
2A and the weighted difference set A — A. 

3. Let A = {1,2,3,4,5} be a weighted set with w4(a) = 1 for alla € A. 
Define f : A— A by f(1) = f(2) = 3 and f(3) = f(4) = f(5) = 2. 
Compute wif Ya). 

4. Let (A, wa) be a weighted set, let f : A — B be a function, and let 
wi be the weight function induced on B by f. Prove that 

S- wala) = So wP0). 
acA beB 

5. Let A = {1,2,3,...,n} and let wy, be a weight function on A. Let 
S;, be the group of all permutations of A. If r € S,, then: A> A 
induces a weight function wy) on A. Prove that wy (a) = wa(a) for 
all7v € S, and a € A if and only if wa is a constant function. 

6. Prove that Theorem 12.1 implies equation (12.2). 

7. Let A be a weighted set. Prove the weighted set identity 

(A — A) — (A- A) = 2A-2A. 
8. Let A be a set of integers of cardinality k. Prove that 


ke+k 
Piper ee 
2 
and 
|A—A| < kh? —k+1. 
For every positive integer &, construct a set A such that |A] = &, 


|A+ A| = (k? + k)/2, and |A— Al =k? -—k+1. 


12.2 Linear and Quadratic Equations 


In this section we obtain upper bounds for certain linear and quadratic 
diophantine equations. 
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Lemma 12.3 Let Q > 1. Let uy,..., uz be relatively prime integers such 
that 
U = max{|ur|,...,lusl} <Q. 


For every integer m, 


UV, +++ + URUE = ™M \ z (k— 1)!3Q)* 


: 12.5 
with |v1|,..-, uz] < Q U (12.5) 


NSE { 


Equivalently, for i = 1,...,4 we can define the weighted sets A; = {vu € 
Z: |v| < Q} with weights w4,(v) = 1 for all v € A;. Let B be the range 


of the function f(v1,...,0%) = U1v1 + +++ + ugvg. The lemma asserts that 
w'y’(m) < (& — 1)(3Q)-1/U. 
If we choose any & — 1 numbers vj,...,v%—1, then there exists at most 


one number v, that satisfies the equation u,v; +---+ugv, = m. This gives 
the trivial upper bound (2Q + 1)*-! < (3Q)*-} for (12.5). A nontrivial 
assertion of the lemma is the denominator U in Q*~!/U. 

Proof. The proof is by induction on k. If k = 1, then gcd(ui) = 1 and 
U = |u,| = 1. The number of solutions of the equation ujv, = m with 
\ui| < Q is at most 


01(3Q)° 
7 
Let k = 2 and U = max{|uj|, |wo|} = luo]. If 


1= 


uyv, + Ugvg = ™M, (12.6) 


then 
uiv; =m _ (mod U). 


Since (uy, u2) = (ui, U) = 1, we have 
vy =u,'m (mod U). 


The number of integers v; in the congruence class u;'m (mod U) with 
|ui| < Q is at most 


2Q 3Q 


gtisg Tr (since U < Q). 


For each such integer v; there is at most one integer v2 that satisfies the 
linear equation (12.6). Therefore, 


3 
NSE {ujvj + ugvg = m_ with |v;|,|v2] < Q} < a 
Let k > 3, and assume that the lemma holds for & — 1. Let 


U = max{wuy,..., ur} = lug. 
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If u; = 0 fori =1,...,4—1, then 1 = (u,.. 
the number of solutions of (12.5) is at most 


(& — 1)1(3Q)** 
7 , 


: seis Up) = |wre| _ U, and 


(2Q + 1)*-! < (3Q)*"* < 


If u; 4 0 for some i < k — 1, then 


@= (Gijaciptpai) 2 1. 


In this case, we define 


/ 
U; — 
and 
/ / / U 
U" = max{|uj|,-.-,[U,_alt S a 
Then (w,...,u,_,) =1. Consider the linear equation 
Uyvt tee fb Up_yUp-1 = mM’. (12.7) 
By the induction hypothesis, 
U1vy +++ + Up—1UR—-1 = dm! 
NSE : 
{ with Mineo E43) OQ \ 
= NSE {(12.7) with |vi|,...,]vz—1| < Q} 
(k — 2)'(3Q)*? 


< 
< ia 
If the integer m’ can be represented in the form (12.7) with |v;| < Q, then 


Im'| < (k-)U'Q. 
Since (d,uzx) = (u1,..-.,Ug—-1,Uk) = 1 and max{d, |uxz|} = |ug| = U, it 


follows that 
UzVy +++ + URUE = ™M \ 


ee { with |vi|,..-, |v“) < Q 
Uyvy + +++ + Up—1UR—1 = dm! 
< NSE F 
~ { with [Bailes esa SO \ 
dm! + ugv_y, =m with 
«NSE { nh fate eG 


< (R= DIGQI  3&-HU'Q 
U’ U 

_ (k—1)!(3Q)*" 

— oe 


This completes the proof. 
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Theorem 12.2 Let k > 3 and let P,Q, and c be real numbers such that 
1<P<Q<cP*. 
Consider the quadratic equation 
U1V1 +--+ + URVE = 0 (12.8) 
in 2k variables uy,...,Ug,V1,;---,UR- Then 
Uyvy +--+ + URvE = 0 
NSE ¢ with |u;| < P and |v;|<Q > <k (PQ)*t. 
fori=1,...k 


Proof. If uj =--- = uz = 0, then the number of solutions of (12.8) with 
|u;| < Q is at most 


(2Q+1)* < (3Q)* =3Q(3Q)*” 
< eP**(3Q)*"* = 3*e(PQ)* 
<k,c eor 


Suppose that u; 4 0 for some 7. Then 
1<U =max{|u|,...,]ug|} < P. 


There exists a unique nonnegative integer m such that 


P 


om (12.9) 


The number of equations of the form (12.8) with |u;| < U < P/2™ does 
not exceed fe ‘ 

2P re é 3P 

Qm = Qm 7 


(Ui,s--5 UE) = 1, 


If 
then by Lemma 12.3, the number of solutions of each such equation with 
|u;| < Q is at most 


(k—1)4(3Q)*"* | (k—1)lamt1(3Q)*-1 
U = P 


Therefore, the number of solutions of all equations (12.8) with (wi,...,ug) = 
1 and U in the interval (12.9) is less than 


(k —1)12"+1(3Q)F-1 /3P\* — 6(k—1)'(9PQ)*-1 
P (= = 9(k-1)m . 
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Summing over m, we obtain 


Uyvy +--+ + UupvE = 0 oo 6(k 
NSE ¢ with |u| < P,|vi| < Q, < sy 
and (t1,...,Ur) =1 m= 


8(k — 1)\(9PQ)*- 


If (wi,...,ux) = d, we define ui = u;/d for i = 1,...,k. The integers 
uj,..., uj, are relatively prime, and |u/| < P/d. The integers v1,..., vz are 
a solution of equation (12.8) with |u,;| < P if and only if (v1,...,v%) isa 
solution of the equation 


(9PQ)*- 
a 1)m 


Oo 


IA 


uy te + u,v, =O — with |ui| < P/d. 
Therefore, 


Uuyvy te++ + upvE = 0 P k-1 
NSE ¢ with |u| < P, |u| < Q, 8(k — 1)! € (3) a) 
d 
and (u1,...,Un) =d 


A 


8(k — 1)\(9PQ)k-} 
qk-1 


For k > 3 we have 
ae. Va © dx k-1 
14 = <2. 
ye = / get ~ 2 > 


Summing over d, we obtain 


UV, ++++ + Upvp =O oo pee 
8(k — 1)K(9P. 

NSE ¢ with jul <P,lul<Q, > < > ( AG Q) 

and u; 4 0 for some i d=1 

< 16(k—1)\(9PQ)*-' 
Therefore, 
UjVy + +++ + UVR = 0 
NSE { with |u;| < P and |v;| < Q \ 


<  3*c(PQ)*-! +. 16(k — 1)!(9PQ)*-1 
Soe: (PO, 


This completes the proof. 


12.3 An Upper Bound for Representations 387 


Exercises 
1. Find all solutions of the linear diophantine equation 
6v, + 10ve + 15v3 = 0 with |v1|, |v2|,|v3| < 10. 


Compare the number of solutions with the upper bound obtained 
from Lemma 12.3. 


2. Find all solutions of the linear diophantine equation 


6v, + 10ve + 15v3 = 1 with |v1|, |v2|,|v3| < 10. 


3. Find all solutions of the quadratic equation 
U,V, + U2v2 + u3v3 = 0 


with |u;| < 1 and |u;| < 1 for i = 1,2,3. Compare the number of 
solutions with the upper bound obtained from Theorem 12.2. 


12.3. An Upper Bound for Representations 


We can now prove Theorem 12.3, which gives the fundamental upper bound 
for the number of representations of an integer as the sum of a bounded 
number of values of polynomials of degree k. We need the following standard 
result about polynomials. 


Lemma 12.4 Let 
k 
f(z) = > ax’ 
i=0 
be a polynomial of degree k with complex coefficients. Then 


f@+u) — f(#) = ugu(2), 


where 


is a polynomial of degree k — 1 with coefficients 


k ; 
ai(u) = (7)aju 
j=itl 
For any positive number P, if 
lal < aP, 
|| < 2c, P, 
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and . 
las|<cP*-* for i=0,1,...,k, 
then . 
lai(u)| < c(4c,)*kP*-*-* fori =0,1,...,k—-1 
and 


Igu(x)| < e(2c1)**k? PE 


Proof. This is a purely formal calculation. We have 


k 
f(atu)—f(xz) = S/ a;(a + uy Loa 
j=0 
- j-1 
= Soa (Zaha ‘ 
j=l i= 


0 
k 
| De 


()a ys t—-1 a 
i=0 j=it+l 


= UGy(2). 


If |a;| < cP*~* and |u| < 2c,P, then 


(12.10) 


k : k 
J paG ; = eo 
lau) < SO (7) lalla << S> 2%eP*4(2c, Pi 
j=it1 j=i4+l 
< c(4c,)*kP*-1-*, 


If also |a| < cP, then 


gu(2)| aj(u)]|2|" 


M4 


IA 


c(4c,)*kP*-1-*(e, P)? 


IA 


26) * RPE. 


This completes the proof. 


Theorem 12.3 Let {s(k)}?°2, be the sequence of integers defined recur- 


sively by s(1) = 1 and 


s(k) = 8k2les2s(@-1] for k > 2. 


(12.11) 
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Letc>1. For j =1,...,s(k), let 


k 
fie) =S- age 
1=0 


be a sequence of polynomials with complex coefficients such that 
lanj|< ec forj=1,...,s(k). 
Choose P > 1 such that 
la;;|<cP** fori=0,1,...,k—1 andj =1,...,8(k). (12.12) 
Let cy > 1. For every complex number z, 


3 fi(2j)=2 with a eZ 


NSE 
{ and |x;| <P for j =1,...,s8(k) 


\ Regge tPeP-F. (913) 


Proof. The proof is by induction on the degree k& of the polynomials. 
For k = 1 we have s(1) = 1 and f(a) = ay,” + ag. For any number z, 
there exists at most one integer x; such that f;(71) = z, and so 


f(a) =z with LE Z _ ps(1)-1 
Ne <1=P 


Let & > 2, and assume that the theorem holds for s’ = s(k — 1) polyno- 
mials of degree k — 1. Define 


t = t(k) = [logy s‘] +2 


and 
s = 8(k) = 2k2* = gKgilose s(k—-D) 


Since [a] < a < [x] + 1 for every real number 2, we have 
gl = 2082 3! < gllog. s‘J+1 _ gt-1. 
Consider the weighted set (X,wx)), where 
X={r@eEZ: |x| < cP} 
and wx(a#) = 1 for all x € X. For j = 1,...,5 we have the weighted sets 
Fy = {fj(@) se € X} = {fj(x) : |e] S eo P} 


with weights 


w0!))(z) = NSE {f;(x) = 2: |x| < ex P}. 
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Let S be the weighted sumset 
S=F,4+---4+ F;. 


Then 
ws(z) = NSE Sen =z with |zj;|<aP 
j=l 
For g 
=_=k?2' 
M=>5 k2", 


we consider the weighted sumsets 
Bi =F,+-:-+ Fn 
and 
Bo = Pm4it::+t Fam, 
and the weighted difference sets 


m 


Dy = By - By = (f(y) — fila) = esl lug] < oP 
j=l 
and 
2m 
Dz = Ba-Be= 4 >> (f(y;) — flay) = esl, lug] < oP 
j=m+1 


Applying Lemma 12.1 to S = B, + Bo, we obtain 


For j = 1,...,s, let 


f(a + u) — fi(@) = ugju(2), 
where gj,.(x) is the polynomial of degree k—1 constructed in Lemma 12.4. 
We can use our result on quadratic equations and weighted sets (Theo- 
rem 12.2) to obtain upper bounds for the weights wp,(0) and wp,(0). If 
\x;|,|yj;| <P and u; = y; — a;, then |u,| < |x,| + |y;| < 2c1P. It follows 
that 
dja Fi (ys) — Fy(a;)) = 0 
= NSE J=1\JINAI INT. 
wo) = NSE | Sine h tl Sa? 
NSE ei (Fi (ay + uy) — fi(aj)) = 0 
with |;| <c¢iP and |u;| < 2c;P 


= NSE { ae Uj9j,u;(€j) = 0 


with |x;| <c¢iP and |u;| < 2c;P 


= NSE { Dia YiGjuj(@j) =0 with 1 


x;|<c,P for 7 =1,...,m 
Laval poss, Petes | See leg] Se ors ered 


IA 
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Similarly, 


2m F 

pie Ug gear e) = 0 with 
wp,(0) < NSE { Dojeme1 U5 4 
; ju 2 |<2e1P |x;| <a P for j =m+1,...,2m 


For j = 1,...,m, we fix integers u; with |u;| < 2c;P, and consider the 
weighted sets 
G5 = {9j.u, (2) + |2] < er P} 
and 
Gi = Uy * {9j,u5(@) + 2] S xP} = {Uj 9j,u;(@) + |2| < er PH, 
with weights 
wa, (z) = wat (ujz) = NSE {9j,u,(a) = 2: |2| < 1 PH. 


Recall that m = k2'. For q=1,...,2', we define the weighted sets 


By = “Goa Gg aye bo + Gg 
/ = t—lpl t-lpi 
Dy sO Bh ote, 
and 
m at 
$= 326,- 575% 
j=l q=l 
Then 


ws: (0) = NSE SS ia (z;) =0 with |x,;|<aP 


j=l 


By Lemma 12.2, 


We can express the difference set Di, as follows: 


/ _ t-lpi t-lpl 
De 0 pear tg! 
k k 
2 e Gq-1ykte ~ ar y Glq-1)k+r 


r=1 r=1 


k 
= Do teense * (2° 7G q—1yetr — 2G q—ayesr) 
r=1 


k 
= tases Vere 


r=1 
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where 
Viq—1)k+r = 2°" Gq—1)k+r — OG Gy Aykae 


Let v € Viq-1)k+r- By Lemma 12.4, if |x| < cP, then 
I9(q—1)k+ riq—1yn¢r (2) < ee) th PR, 


and so 
|u| < c(2c,)?*k?2' PR}, (12.14) 


We shall use the induction hypothesis for polynomials of degree k — 1 to 
obtain an upper bound for the weight of v. Let 


Ju(2) = Iq—-1Wk-+ry¢qg—1)e4r (2) = a;(u)x . 


By Lemma 12.4, we have 


|a(u)| < c(4c,)*¥kP*-1-* 


fori =0,1,...,4 —1. Since s’ = s(k — 1), for every number z’ we have 
NSE 2 Dja19(23) = 2 Kise ey PY, 
with |z;|<ciP for j =1,...,5 oe 


Since s’ < 2'—!, we obtain the following upper bound for the weight of v: 
v= gr Gul@a) — gna Gul) 

WViq—rye+r(v) = NSE beri dee eee t-1 
with |z,|,|2,| < ai P for g=1,...,2 


a Ju(%q) = ‘ 
ps 


= anc} 2 
= NSE ut get Ju(X4) a aac Ju(Xq) 
with: |oel;|eg|S GP for g = Toc 2" > 


= S- NSE ae Ju(X4) = Daan Gil Za) 


if , . 
qlee yy ah with |vg| <ciP for g=1,...,8’ 
Bog yleslege—11Se1 P 
» s’—(k-1 
<k,c,c1 P ( ) 
Bh lies loreal 
I@ ora qhers l@o¢—11Se1P 


tof ee | 
ere P? s ps + 


t 
le cs P? k+l 


Therefore, there exists a constant c’ = c(k,c,c,) such that wy,(v) < 
c P2'-k+1 for all j =1,...,mand ve Vj. 
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Let U be the weighted set of all integers wu such that |u| < 2c;P and 
wyu(u) = 1 for all u € U. Let V be the weighted set of all integers v that 
satisfy inequality (12.14) and have constant weight wy(v) = cP2’—**1, We 
can now find an upper bound for the weights wp, (0) and wp, (0): 


wp, (0) < Ss ws? (0) 
(iil ssccfeiea ea P 


IA 
NI) 
M 

€ 
& 

Ss 


1 
< pares 
a at 
Juil,.., oes 
k 
2 > (¢P%-)" NSE { drat U(q—lk+rU(q—-Dk+r = 9 \ 
o= with U(q—1)k+r € Veq—1)k+r 
pm-k? +k 
<Kk,c.c1 a 
=0 
x NSE { Dorat Ugh rMg—1yk+r = \ 
Se with Uq—-ktr © V 
pink? +k 
t k 
7 ee U(q—1)k4+rU(q-1)k+r = 0 
x SSO NSE ¢ with o¢-1)k4, € V and 
q=1 wa one U(q—1)k+19++ +> Ugk EU 
pm-k? +k pm—k 
Kk,c,c1 gt 
t k 
2 pee U(q—1)k+rU(q—1)k+r = 0 
x S "NSE with vg—1)k4+r © Viq—1)k+r and 
q=l U(q—-1)k+19 +++» Ugk © U 
k 
C,c P®- NSE Lir=t CES 0 
SMoiey with v, € V and u, € U 
<Kk,cc per (pees (by Theorem 12.2) 
Kk,c,c1 pret 
Similarly, 
wp, (0) <Kkyc C1 P a 
Therefore, 


(wp, (0) + wp,(0)) kee, P&*. 
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This completes the proof. 


Exercises 
1. Compute s(k) for k = 1,2,3,4,5. 


2. Prove that 
A¥-1k! < s(k) < 8°~1k! 


for k > 2. 


12.4 Waring’s Problem for Sequences of 
Polynomials 


In Chapter 11 we applied a special case of Theorem 12.3 to prove Waring’s 
problem for a polynomial. In this section we show how the full strength 
of Theorem 12.3 yields a generalization of Waring’s problem to finite se- 
quences of polynomials. Let c > 1. For j = 1,...,s, let f;(x) be an integer- 
valued polynomial of degree k whose leading coefficient a,; satisfies the 
inequality 0 < ax; < c. We consider the sequence 


F={fi(@) tja1- 


We shall prove that there exist integers s(k) and h(k) and a positive number 
6(k,c) such that if s > s(k), then the set 


Sa (filer) fel@s)t Big ranges S Noy 


has lower asymptotic density dz(S) > 6(k,c) > 0, and if s > h(k), then S 
is eventually coincides with a union of congruence classes. 
We define the representation functions r¢(n) and R¢(N) by 


r¢(n) =NSE { fila) +--+ falas) = 7 


with H1,+++,h5 E No 


and 
Re(N)= So re(n). 
0<n<N 
Lemma 12.5 Let c > 1. Let F = {fj(x)}5_1 be a sequence of integer- 
valued polynomials of degree k, and let ax; be the leading coefficient of 
f(z). We assume that 
0 <apj <e 
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for j=1,...,s. If N is sufficiently large, then 


s/k 
~~) (12.15) 


Rr(N) > 5 (= 


Proof. Define x*(f;) by (11.4) for 7 =1,...,s. If the integers x, satisfy 
the inequalities 


a" (fj) <2jS ( 


then, by Lemma 11.4, 


3dp x 3c (2N N 
Ds ey ft < S(E)-4 
and 
O< fil(vi) +--- + f(a) < N 
Therefore, 


for N sufficiently large. This proves (12.15). 


Lemma 12.6 Let F = {fj(x)}}_1 be a sequence of integer-valued polyno- 
mials of degree k, and let ay; be the leading coefficient of f;(x). Let c > 1. 
We assume that 


0 < apy Se 
and that A(f;) = {f;(x) : « € No} is a strictly increasing sequence of 
nonnegative integers for j =1,...,8. There exists a number Ni(F) such 
that if N > Ni(F) and a1,...,v, are nonnegative integers with 
s 
Yo fla) <N, 
j=l 
then 


a; < (4kIN)'/* for j =1,...,8. 


Proof. The proof is the same as the proof of Lemma 11.5. Recall that 
klay; = 1 by Exercise 6 in Section 11.1. Define x*(f;) by (11.4) for j = 
1,...,8, and 2*(F) = max{a*(fi),...,a*(fs)}. Let 


u*(F)* 


MN oe 


(12.16) 
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If N > Ni(F),1< <8, and 2 > (2k!N)/* > 2*(F) > x*(fe), then 


k 
felae) = “MEE > hlawN > N, 


and so : 
S~f(2;) = flee) = flee) > N. 
j=l 
It follows that if x1,...,2, are nonnegative integers such that 
doses) SN, 
j=l 
then 


xy < (2kIN)i/* for 7 =1,...,8. 


This completes the proof. 


Theorem 12.4 For any positive integer k and real number c > 1, there 
exists a number 6(k,c) > 0 with the following property: If s = s(k) is the 
integer defined by (12.11), and if F ={f;(x)}f—1 ts a sequence of integer- 
valued polynomials of degree k whose leading coefficients ay; satisfy 


0 < apy <c, 
then the sumset 
B={fi(ti)+---+ fs(@s) 2 @1,...,25 € No} 
has lower asymptotic density 
dz(B) > 6(k,c) > 0. 


Proof. Replacing the polynomial f;(x) with f;(a+ xo) for a sufficiently 
large integer xo, we can assume that {f;(x) :  € No} is a strictly increasing 
sequence of nonnegative integers for 7 = 1,...,s. 

Define Ni(F) by (12.16). Choose N2(F) sufficiently large that for N > 
N2(F) and P = N\/*, we have 


laij| < cP*~* fori=0,1,...,k—-1, 
and so Theorem 12.3 applies to the polynomials in the sequence F. 
Let N(F) = max{Ni(F), No(F)) and c, = (2k!)!/*. By Lemma 12.6, if 


N > N(F) and 21,...,2, are nonnegative integers such that 


filei) toot felt) SN, 
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then 7; <c,P for 7 =1,...,s. Therefore, if0 <n < N, then 


- fam) +--+ folos) =n 
re(n) = NSE { oan EN for j = 1,...,8(k) 


S nse { AG st ieen elle 


with 0 <2; <c,P for j =1,. 


Sis ps-k 


by Theorem 12.3. Let B(n) be the counting function of the set B. We have 


N N 

RN) = S- rF(n) = a rz(n) 
n=0 n=0 

Ke Biw)prt = BAP 


By Lemma 12.5, 
s/k 
1/2 
naw) > 3 (2) 


3cs 


It follows that B(N)/N > ,- 1. This completes the proof. 


We say that sets of integers A and B eventually coincide if there exists 
a number no such that n € A if and only if n € B for all n > no. By 
Theorem 12.4, the set of sums of s(k) integer-valued polynomials of degree 
k; has positive lower asymptotic density, but not necessarily a rich arith- 
metic structure. For example, sets of positive density can have arbitrarily 
large gaps between consecutive elements. We shall prove that there exists 
a number h = h(k,c) such that the set of sums of h(k,c) integer-valued 
polynomials of degree & with positive leading coefficients not exceeding c 
has bounded gaps, and, moreover, eventually coincides with a union of con- 
gruence classes. The proof of this result requires a deus ex machina in the 
form of a theorem of Kneser on the asymptotic density of sumsets. We do 
not prove Kneser’s theorem in this book, but this application of Kneser’s 
theorem gives a generalization of Waring’s problem that is too beautiful to 
resist. 

For i =1,...,d, let B; be a set of integers with lower asymptotic density 
dr (B;) = Gi, and let S = By +---+ By. Kneser’s theorem states that if 
dzi(S) < 6,+---+ Ga, then there is a modulus m > 1 such that the sumset 
S eventually coincides with a union of congruence classes modulo m. 


Theorem 12.5 Let k be a positive integer andc > 1. There exists a posi- 
tive integer h = h(k,c) with the following property: Let F = {f;(x)}'_, be 
a sequence of integer-valued polynomials of degree k such that the leading 
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coefficient az; of f;(x) satisfies the inequality 0 < ap; <c¢ forj =1,...,h. 
There exists a positive integer m such that the sumset 


S={filti)+---+ faltn): 2; No forj =1,...,h} 
eventually coincides with a union of congruence classes modulo m. 


Proof. Let s = s(k) be the positive integer constructed in Theorem 12.3 
and let 6 = 6(k,c) be the positive number constructed in Theorem 12.4. 


We define 
1 
d= |= 1 
5] ss 


and 
h = h(k,c) =ds. 


Let F = {f;(z) Bes be a sequence of integer-valued polynomials of degree 
k whose leading coefficients are positive and not greater than c. For i = 
1,...,d, let Fy = {fix—1)s45(£)}fa1- By Theorem 12.4, the sumset 


B; => S> fa—1)s+5 (23) : Xj € No 


j=l 

has lower asymptotic density dz(B;) > 6 > 0. Since 
j=l 

and 


Ya (B) > dd=6 ( + 1) >1>d,(S), 


Kneser’s theorem implies that S' eventually coincides with a union of con- 
gruence classes modulo m for some positive integer m. 


12.5 Notes 


This proof, so exquisitely elementary, will undoubtedly seem 
very complicated to you. But it will take you only two to three 
weeks’ work with pencil and paper to understand and digest it 
completely. It is by conquering difficulties of just this sort, that 
the mathematician grows and develops. 
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A. Ya. Khinchin [78] 


The proof to which Khinchin refers is Linnik’s elementary proof of War- 
ing’s problem. It is the “third pearl” in Khinchin’s famous book Three 
Pearls of Number Theory [78]. The quotation is the last paragraph in the 
book. 

Theorem 12.3 generalizes a result of Linnik for sums of one polynomial 
to sums of a sequence of polynomials. Linnik’s result provides the essential 
upper bound in his solution of Waring’s problem. 

Often, theorems in number theory and, in particular, variants of Waring’s 
problem, are first proved analytically, and only later are elementary proofs 
discovered. Theorem 12.4, due to Nathanson, is an unusual example of a 
result that was first proved by elementary methods. 

For a proof of Kneser’s theorem [79] on the asymptotic density of sumsets, 
see Halberstam and Roth [48] and Nathanson [108]. 


13 
Liouville’s Identity 


13.1 A Miraculous Formula 


In a series of eighteen papers published between 1858 and 1865, Liouville 
introduced a strange and powerful method into elementary number theory. 
In this chapter we prove an important identity of Liouville. We shall apply 
it in Chapter 14 to obtain theorems about the number of representations 
of an integer as a sum of an even number of squares. This is our second 
problem in additive number theory. 

Recall that a function f(x) is called even if f(—x) = f(x) for all a. A 
function f(x) is called odd if f(—x) = —f(x) for all x. If f(a) is odd, then 
f(0) = —f(0), and so f(0) = 0. 

The function F(z, y, z) is odd in the variable z if F(—2a, y, z) = —F(2,y, 2), 
and even in the pair of variables (y,z) if F(x,—-y,-z) = F(a,y,z). If 
F(a,y,z) is odd in the variable y and also odd in the variable z, then 
F (x,y,z) is even in the pair of variables (y, z). For example, the function 
F (x,y,z) = xyz is odd in the variable x and even in the pair of variables 
(y, 2). 

In this and the following chapter, u,v, and w denote integers, and d, 6, 
and @ denote positive integers. The notation 


og 


uz+d6=n 
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means the sum over all ordered triples (u,d,6) such that u? + dé = n. For 
example, 


S> Gud) = G(0,1,3) + G(0,3,1) + GQ, 1,2) 
7? fb =8 
+ G(1, 2,1) + G(—1, 1,2) + G(-1, 2,1). 


We define the symbol {T(€)}n—¢2 as follows: 


_ Jj 0 if n is not a square, 
{TO }n=e = { T(@) if nis a square and n= @?. 


Liouville’s fundamental identity is the following. 


Theorem 13.1 (Liouville) Let F(x,y,z) be a function defined on the set 
of all triples (x,y,z) of integers such that F(x, y,z) is odd in the variable 
x and even in the pair of variables (y,z). For every positive integer n, 


2 S° F(6-2u,u+d,2u+ 2d — 6) 


u2+db=n 
= YS) F(d+6,u,d—6) + {2Ti(0) —Te(O}n=0, 
uz+d6=n 
where 
20-1 
Tif) = >| F245) 
j=l 
and 
é-1 
T(é)= >| F(20,9,25)- 
j=—f+1 


For example, there are six triples (u, d, 6) such that u?+dé = 3. Liouville’s 
formula for n = 3 asserts that 
2(F(3,1,-1) + F(1,3,5) + F(0, 2,2) + F(-1, 3,5) + F(4,0,-2) + F(3,1,1)) 
= F(4,0,2) + F(4,0,—-2) + F(3,1,1) + F(3,1,-1) + F(3,—-1,1) 
Oe i en ea 


It is easy to check this identity using only the parity properties of the 
function F(a, y, z). 

We shall prove Theorem 13.1 in Section 13.4. 

Liouville’s identity is very general, and we can specialize it in many ways. 
Here is an example. 


Theorem 13.2 Let f(y) be an odd function. For every positive integer n, 


SS) (-1)"f(ut d) = {(-1)* 4F(O)},_p- 


u24+d6=n 
6=1 (mod 2) 
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Proof. We define the function 


Feana={ if x or z is even, 
a (—1)@+2)/2 f(y) if a and z are odd. 
Then F(a, y, z) is an odd function of each of the variables x, y, and z, hence 
an even function of the pair of variables (y, z). If x,y, z are integers and 6 
is even, then 6 — 2x is even, and so F'(6 — 2a, y,z) = 0. 

We shall apply Theorem 13.1 to the function F(z, y, z). The left side of 
Liouville’s identity is 


2 So F(6—2u,u+d,2u + 2d — 6) 
u2+dé=n 
= 2 So F(6-2u,u+d,2u+2d—6) 


u2+dé=n 
6=1 (mod 2) 


= 2 D> (-1)%f(u+d) 
u2+dé=n 
6=1 (mod 2) 


(-1) f(u + d) 


T 
ed 


(-1)"-“" f(ut+d) 


I 
M 


u2+dé=n 
6=1 (mod 2) 


= 2-1)" So (-1)"f(utd). 
fa ae 


The right side of Liouville’s identity is 


S) F(d+6,u,d — 5) + {2T,(0) — T2(0)}n=e. 


u2+d6=n 
If u? + dé = n, then also (—u)? + dé =n, and the map 


is an involution! on the set of solutions of the equation u? + dé = n. Then 


I 


S" F(d+6,u,d — 6) S> F(d+6,-u,d 6) 


u2+dé=n u2+d6=n 


= — S> F(d+6,u,d-6), 
u2+dé=n 


1An involution on a set X is amap a: X — X such that a? is the identity map. 
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since F(x, y, z) is an odd function of y. Therefore, 


S" F(d+6,u,d—6) =0. 


u2+dé=n 
If n = é?, then 
2l-1 
CO = SFG) Paes) 
j=l 1<jS2e-1 
j=1 (mod 2) 
L e 
= So F(i-1,62-1I =-S>fO 
i=l j=l 
= —f(é) 
and a 
Doe 5 Fee 3.27) =o: 
j=—é41 
Therefore, 


2 >) C)*f(utd) = (-1)™{-fO}nee 


= {(-1)ef(O}n=e- 


This completes the proof. 


Exercises 


1. Let F(x, y, z) be a function that is odd in x and even in (y, z). Write 
out Liouville’s formula in the case n = 4, and confirm it directly using 
only the parity properties of F(a, y, z). 


2. Prove that for every positive integer n the diophantine equation 
wtuww=n 


has infinitely many solutions in integers u,v, w, but only finitely many 
solutions in integers with v > 1 and w > 1. 
13.2 Prime Numbers and Quadratic Forms 


A quadratic form is a homogeneous polynomial of degree two. The quadratic 
form Q(z, y,...,2) represents the integer n if there exist integers a,b,...,¢ 
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such that Q(a,b,...,c) =n. A binary quadratic form is a quadratic form in 
two variables. A ternary quadratic form is a quadratic form in three vari- 
ables. In this section we apply Theorem 13.2 to obtain classical theorems 
about the representation of prime numbers by the binary quadratic forms 
x? +y? and x? 4 2y?. 

We begin with some results about divisors. Recall that a positive integer 
d is called a divisor of the positive integer n if there exists an integer 6 
such that n = dé. The integer 6 is called the conjugate divisor of d. The 
divisor function o(n) is the sum of the divisors of n, that is, the arithmetic 
function defined by 

a(n) = S- d. 


d|n 


We denote by o*(n) the sum of the divisors of n whose conjugate divisors 
are odd. For example, o(10) = 1+24+5+10 = 17 and o*(10) = 2+10 = 12. 
If p is an odd prime, then o(p) = o*(p) = p+1. 


Lemma 13.1 Let n be an odd positive integer. Then o(n) is odd if and 
only if n is a square. 


Proof. Let 


n=|[p” 


p|n 


be the unique factorization of n as a product of odd prime numbers. The 
positive integer d divides n if and only if d can be written in the form 


where 


and so 


= [[@ +1) (mod 2) 


= 1 (mod 2) 
if and only if up is even for all p, that is, up = 2w, and 


2 


n= [[e” - Io” 


pin p|n 


is a square. This completes the proof. 
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Lemma 13.2 If n = 2*m, where k > 0 and m is odd, then o*(n) = 
2*a(m). If o*(n) is odd, then n is the square of an odd integer. 


Proof. Let d be a divisor of n. If the conjugate divisor 6 = n/d is odd, 
then 2* must divide d, and so d = 2d! for some integer d’. Then 


2m =n = db = 2*d’6, 
and d’ is a divisor of m. Conversely, if d’ is any divisor of m, then 2*d’ is 
a divisor of n whose conjugate divisor m/d’ is odd. Therefore, 
o*(n) =2* S- d' = 2*o(m). 
d’|m 


If o*(n) is odd, then k = 0 and n = m is odd. It follows that o*(n) = 
a(m) = a(n) is odd, and so n is a square by Lemma 13.1. This completes 
the proof. 


Lemma 13.3 For every positive integer n, 
o(n)=2 So (-1)"71o*(n =u?) + {(-1)" nae. 
l<u</n 


Proof. We apply Theorem 13.2 to the odd function f(y) = y. Ifn = @, 
the right side of the identity is 


(-1) Mf (Q) = (-1)"18 = (-1)" In. 


To obtain the left side of the identity, we recall the involution (13.1) on 
triples (u,d,6) such that u? + dé =n and 6 is odd, and obtain 


S> (-1)*u=0. 


utdé=n 
6=1 (mod 2) 


utdé=n utdé=n 
6=1 (mod 2) 6=1 (mod 2) 


= So (-1ytut+ SS (-1)"d 


u24d6=n u24+d6=n 
6=1 (mod 2) 6=1 (mod 2) 


= So (-1)"d 


u2+dé=n 
6=1 (mod 2) 


- ye y a 


u2<n n—u2=déd 


= SO (-1)*o*(n—- vu’). 
lul<vn 
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Therefore, 


This completes the proof. 


Theorem 13.3 (Fermat) An odd prime number p can be represented by 
the quadratic form x? + y? if and only ifp=1 (mod 4). 


Proof. Since every square is congruent to 0 or 1 modulo 4, it follows 
that a sum of two squares must be congruent to 0, 1, or 2 modulo 4, and 
so no integer congruent to 3 modulo 4 can be represented as the sum of 
two squares. 

Let p be an odd prime number. Then p is certainly not a square. By 
Lemma 13.3, 


a*(p) = 20*(p— 1) — 20*(p — 4) + 20* (p— 9) —---. 


Since o*(p) = p+ 1, we have 


= 0*(p— 17) — o* (p— 2”) + o* (p — 37) 


If p= 1 (mod 4), then (p+ 1)/2 is an odd integer, and so at least one 
of the terms on right side of this equation must be odd. Thus, there exists 
a positive integer b < \/n such that o*(p — 67) is odd. By Lemma 13.2, 
p — b? = a? for some odd integer a. This completes the proof. 


Theorem 13.4 If p is a prime number such that p= 1 (mod 4), then 
there exist unique positive integers a and b such that a is odd, b is even, 
and p= a? + b?. 

Proof. Let 

p= aj + bj =a} + by, 

where a; and ag are positive odd integers and 6; and bz are positive even 
integers. We must prove that a, = a2 and b; = bo. 

If ay < dg, then b; > by and there exist positive integers x and y such 
that 

ag =a, +22 


and 
bo = by = 2y. 
Then 
p = a+} 
= (a, + 2x)? + (br — 2y)? 
24 dayx + 42? 4 b7 Abyy 4 Ay? 
p+ 4a,x + 4x? — 4byy + 4y?, 


I 
ay 
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and so 
x(a, + x) = y(b; — y). 


Let (x,y) = d. Define the positive integers X and Y by « = dX and 
y = dY. Then 


X(a, +2) =Y(bi —y). 
Since (X,Y) = 1, it follows that there exists a positive integer r such that 
rY =a, +x=a,+dxX 


and 
rX = b) —y = 0, — aY. 


Then r? + d? > 2 and #74 y? > 2, and 
pat +b? =(r¥ —dXY + (7X +dVVP SP? + 0P)(07+Y"), 


which is impossible, since p is prime and not composite. Therefore, a1 = a2 
and b; = be, and the representation of a prime p=1 (mod 4) as a sum 
of two squares is essentially unique. 


Theorem 13.5 An odd prime number p can be represented by the quadratic 
form x? + 2y? if and only ifp=1 or3 (mod 8). 


Proof. Since every square is congruent to 0, 1, or 4 modulo 8, it follows 
that an odd integer n is of the form a? + 2b? only ifn =1 or 3 (mod 8). 

Let a be a positive integer, a < \/n. By Lemma 13.3, for every positive 
integer n we have 


o*(n)=2 So (-1)""1o*(n =u?) + {(-1)" nae. (13.2) 


l<u</n 
Let 1 <u < Wn. Applying Lemma 13.3 to n — u?, we have 
* 2) _ v—-1_x 2 2 n—u-1 2 
o(n-w)=2 SD (Fon (CD) (nw nese. 
1l<vu?<n-u? 


Inserting this into (13.2), we obtain 


a*(n) = 4 S- (=1)"*"6* (n— wv? — vy’) 


ujv>1 
u2ztu2<n 


fai > ea) oe $e a. 


l<u<J/n 
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Ifu Av and u2+v? =n, then v? + u? = n and the pairs (u,v) and (v,u) 
both appear in the first sum. Considering congruences modulo 8, we obtain 


f Den 


1)?*"@ * (n —u? — 0") 


u,v>1 

si? ha ce 

= 8 eu 1)**’o*(n — u? — v? )+4 50 a *(n — 2u’) 
l<u<v u>1 
(2awlcA 2u2<n 

= 4 Soo *(n —2u”) (mod 8). 
u>1 
Qu2<n 

Therefore, 
a(n) = 4 Soa *(n — Qu) + 2(-1)” S- {n — U7 }n—u2=02 
u>l u>l 
2u2<n ur <n 

(mod 8). 


+ {(-1)" t}rae 


Let p=3 (mod 8). The prime number p is not a square, and, by The- 
orem 13.3, p is also not the sum of two squares. Therefore, 


YP phpee = {p— Ww }p_ woe =0 


i= 


for all u, and so 
(mod 8). 


45° oa *(n — 2u?) 


u>l1 
Qu2<n 


=o"(p)=pt1=4 


Dividing this congruence by 4, we obtain 
1 
Pt" =1 (mod 2), 


S- o*(n—2u*) = A 


u>l 


and so o*(n — 2b”) is odd for some integer b. Then n — 2b? = a? for some 


odd number a, and n = a? + 2b? 
Let p=1 (mod 8). Then 


o*(p)=p+1=2 (mod 8). 


By Theorems 13.3 and 13.4, there exist unique positive integers a and 6 
such that p = a? + b?, where a is odd and b is even. This implies that 


S" {(p U Ve aj2 a = {p—a"}p-a2ay2 + {p b ie b2=a2 = OC +a =D, 


u>l 
u2<p 
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and so 
2 = o*(p) (mod 8) 
= 4 (p — 2u”) Peak {(p —u)}p-u2=02 (mod 8) 
u>l l<u2<p 
2u2 <p a 
= > es (p —2u”) —2p (mod 8) 
url 
2u2 <p 
= 4\° o (p —2u?) —2 (mod 8). 
url 
2u2 <p 
Therefore, 


and 


4 N° o*(p—2u?)—-2=2 (mod 8), 


uD. 
2u2<p 


S- o*(p—2u*)=1 (mod 2). 


uD. 
2u2<p 


It follows that o* (p—2b7) is odd for some positive integer b, and so p—2b? = 
a? for some odd integer a. This completes the proof. 


Exercises 


1. 


Prove that o*(n) = 1 if and only if n = 2* for some nonnegative 
integer k. 


Let d(n) denote the number of positive divisors of n. Prove that d(n) 
is odd if and only if n is a square. 


Prove that n is a sum of two squares if and only if 2n is a sum of two 
squares. Hint: Consider the identity 2(a? + y?) = (a+y)?+(a—-y)?. 
Let n = 2*m, where k > 0 and m is odd. Prove that n is a sum of 


two squares if and only if m is a sum of two squares. 


Verify the polynomial identity 


(x? + yf) (x3 + y3) = (aire — yrye)? + (z1y2 + 122)”. 


Deduce that if each of the integers n; and nz can be represented as 
a sum of two squares, then their product n,n is also a sum of two 
squares. 


Let k > 2 and let n,,...,nz be positive integers. Prove that if each 
integer n; is a sum of two squares, then the product nin2---nz is a 
sum of two squares. 
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6. For every prime p and positive integer n, let v,(m) denote the highest 
power of p that divides n. Prove that if v,(n) is even for every prime 
p=3 (mod 4), then n can be represented as a sum of two squares. 


7. Let a and b be relatively prime integers, and let p be an odd prime. 
Prove that if p divides a?+b?, then p=1 (mod 4). Hint: Show that 


(ab-')? =—1 (mod p), and so (+) = 1, where (3) is the Legen- 
dre symbol. Recall that (+) = 1ifand only ifp=1 (mod 4). 


8. Let p be a prime number, p= 3 (mod 4), and let a and b be integers. 
Prove that if p° exactly divides a? +0? (that is, p° is the highest power 
of p that divides a? + b?, then c is even. Hint: Let d= (a,6), and let 
p’ exactly divide d. Let a = dA and b = dB, and consider the highest 
power of p that divides A? + B?. 


9. Prove that if n can be represented as a sum of two squares, then v,(n) 
is even for every prime p= 3 (mod 4). 


13.3. A Ternary Form 


We begin with the ternary quadratic form 
Q(x,y,2) = a? + yz. 


A representation of n by the quadratic form Q(z, y, z) is an ordered triple 
of integers (x,y, z) such that Q(a, y, z) =n. We denote by R(n) the set of 
all representations of n by the quadratic form Q, that is, 


R(n) os {(x,y, 2) : Q(z, y, 2) n}. 


We introduce six bijections from the set R(n) to itself. The simplest are 
the involutions 


P(z,y,z) = (x,2,y), 

a(x, y, 2) a (—2,9; 2), 
and 

TE 2) a (z, —Y; —Z) 
Let 


a(z,y,2) = (2-4, 2a+y- 2,2). (13.3) 
If (x,y, z) € R(n), then 


Q(a(x, y, z)) a OG = 820-9 — 2,2) 
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= (z-2)?+(Qr+y-z)z 


S gS 2e7 a 4+ 222+ ye 2 


= a? + yz 


= n, 


and so a(x,y, z) € R(n). Moreover, 
a? (x,y, Z) _ alz - £, 22+ y— z, 2) = (x,y, 2), 


and so a is also an involution on the set R(n). 
Let 


B(z,y,2) =(a@+y,y,—2e —y +2). (13.4) 
If (x,y,z) € R(n), then 
Q(8(2,y,z)) = Qe+y,y,—-2e—yt z) 


= («+y)?+y(-2e—y +z) 
= g?+2%ryty? —2aey—y? + yz 


= ¢+yz 
a 
and so B(x, y,z) € R(n). 
Let 
VW(@sYs 2) = (@— ys y, 2a —y + 2). (13.5) 
If (x,y, z) € R(n), then 
Q(y(#,y,2)) = Qax—y,y, 2x —y+ 2) 


= («-y)?+y(Qr—-y+2z) 


= wg? -Qryt+y?+22y-—y* + yz 


= x? + yz 


= n, 


and so y(x,y,z) € R(n). Moreover, 


y8(2,y,2) = y(et+y,y,—22 —yt 2) 
= @4y—9,4,2¢ +9) 9s (22-94 2) 
= (2,y,2). 
Similarly, 


By (x,y,z) = (2, Y; 2). 


Therefore, 3,7: R(n) > R(n) are bijections with y = 671. 
Finally, we state the following simple lemma, which will be used in the 
proof of Liouville’s formula. 
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Lemma 13.4 Let S and S’ be finite sets, and let 0: S > S’ be a bijection 
with inverse 0-! : S’ + S. If G(s) is a function defined for all s € S, then 


S° G(s) = S> GW7'(s’)). 


sES s'ES! 


Proof. This follows instantly from the fact that 0~1(S’) = S. 


Exercises 


1. Prove that o8o = y and pBop=a. 
2. Prove that Go is an involution. 
3. Prove that 
B(x, y, z) = (z - NYY, % — 2nx a ny). 
4. Compute y" (a, y, 2). 


5. Consider the 3 x 3 matrix 


1 0 0 
A=|{0 0 § 
0 4 0 
Let v denote the column vector 
x 
v= y 
z 


Its transpose is v? = (2, y, z). Show that 
Q(x,y,2) =v" Av. 


6. Let Qi(z,y,z) = 2? + y? — 2%. Check that Q(z,y+ z,y — z) = 
Qi(a, y, z) and Q1 (x, (y + 2) /2, (y =. z)/2) = Q(z, y, z). 
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In this section we prove Theorem 13.1. 
For every positive integer n, we let S(n) be the set of all triples (u, d, 6) 
such that 
Q(u, d,6) = u* + db =n, 
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where u is an integer and d and 6 are positive integers. Then S(n) is a 
finite subset of R(n). Using this notation, we have 


De Ee Des 


u2+dé=n  (u,d,6)ES(n) 
Partition S(n) into three sets Si(n), S_i(n), and So(n) as follows: 


S1(n) = {(u,d,6) € S(n) : 2u+d—6> 1}, 


So(n) = {(u, d,6) € S(n) : 2u+d—6 = 0}, 
and 
S_i(n) = {(u,d,6) € S(n) : 2u+d—6<—-l}. 


Let a be the map on S(n) defined by (13.3). If (u,d,6) € S(n), then d and 
6 are positive integers. If (u,d,6) € Si(n), then 2u+d-—6 > 1, and so 


(u’, d’, 6") = a(u, d,6) = (6 — u,2u+d— 6,6) € S(n). 


Since 
Qu’ +d’ — 6’ =2(6-u) + (2ut+d—6)-d=d>1, 


it follows that a(u,d,6) € S1(n), and so a is an involution on Sj(n). More- 
over, 


6 -2u = 6-2(6-—u) =—(6-2u), 
wt+d = (6—u)+(2u+d—6)=u+d, 


and 


2u’ + 2d! — 6’ = 2(6 —u) + 2(2u + d—6) —6 = 2u4+ 2d-6. 


Let F(a, y,z) be a function that is odd in x and even in the pair (y, z). 
We define the function 


G(a,y, 2) = F(z —2x,"+ y, 2u + 2y — 2). 


If (u,d,6) € Sy(n) and a(u, d, 6) = (u’,d’, 6’), then 


G(u, d, 6) + G(u’, d’, 6”) 
= F(6—2u,u+d,2u+ 2d— 6) + F(s' — 2u’,u’ + d’,2u’ + 2d’ — 6") 
= F(6—2u,u+d,2u+2d—6)+ F(—(6—2u),u+d,2u+2d—6) 


=) 


% 
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since the function F(x, y, z) is odd in its first variable x. From Lemma 13.4 
with S = S’ = S;(n) and § = 8! =a, we obtain 


SN)  -F(6-2u,ut+d,2u+2d-6) = S> Glu, d, 4) 
(u,d,6)ES1(n) (u,d,6)ESi(n) 
= So Gu’, d’, 6’) 
(u,d,6)ES1(n) 


=- So. Giud,6) 
(u,d,6)ES1(n) 
= 0. 
Next we consider triples (u,d,6) € So(n). Since 
2u+d—6=0, 
it follows that 
ae é6—d 
ag 
and ? . 
é6—d d+6 
pee aR (*) igex (+) ~ 2, 
where d+é 
(os 
po 


Therefore, the set So(n) is nonempty only if n is a square. Moreover, the 
integers d and 6 are positive, and so 


1l<d=2@-—6< 2@-1. 
Conversely, if 1 <d < 2@—1, we set 6 = 2€—d and u= @—d. Then 


w+dé6 = (€-d)?+d(2é-d)=C¥=n, 
2ut+d—6 = 0, 
and 
(u,d,6) € So(n). 
It follows that if n = @? with @> 1, then 


So(n) = {(d — £,d,2€— d):1<d<2€-1} 


and 


2-1 
S> -F(6—2u,utd,2u+2d—6) = S> F(d,£,d) = Ty(n). 
(u,d,6)€So(n) d=1 
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To analyze the sum 


S- F(d+6,u,d—6), 
(u,d,6)ES(n) 


we construct a second partition of S(n). Define the three sets S{(n), S’.,(n), 
and S¢(n)(n) as follows: 


Si(n) = {(u,d,6) € S(n):2u—d+6>1}, 
S’,(n) = {(u,d,6) € S(n):2u-—d+6<-1}, 


and 
Sh(n) = {(u, d,6) € S(n) : 2u—d+ 6 = Of. 


We shall prove that 
S° F(6 —2u, utd, 2u+ 2d — 6) 
,d,6)ES_1(n) 
= S> F(d+6,u,d-6) 
(u,d,6) ES} (nm) 


SS F(d+6,u,d—6) 
(u,d,6)ES" ,(n) 


e 


I 


and 


S> F(d+6,u,d—6) = {To(n)}n-e- 
(u,d,6)ES5(n) 


Let @ be the map on S(n) defined by (13.4). If (u,d,6) € S_i(n), then 
2u+d—6<-1, and so —2u—d+6>1 and 


(u’, d’, 6’) = B(u, d,6) = (u+d,d,—-2u—d+6) € S(n). 


Moreover, 


Qu’ —d' +6’ =2(u+d)—d+(-2u—d+6)=6>1, 


and so 
6: S_1(n) > S{(n). 


Let 7 be the map on S(n) defined by (13.5). If (u’,d’, 6’) € S{(n), then 
2u’ —d'+ 6’ >1 and 


(u,d,6) = y(u’,d’, 6’) = (u' —d',d’,2u’ —d’ + &) € S(n). 


Moreover, 


2Qut+d—6 =2(u' —d')+d' — (2u'-d'+6)=-6' <-1 
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and so (u,d,6) € S_i(n). Therefore, the map 
7: S!(n) -+ S_4(n) 
is a bijection, and y = 3+. 
Applying Lemma 13.4, we obtain 
S- F(6 — 2u,u+d,2u + 2d—6) 
(u,d,6)ES_1(n) 
= > Gu, 4,6) 
(u,d,6)ES_1(n) 
= De G(y(u', d’, 6’)) 
(u’,d’,6’)EB(S_1(n)) 
= So Gu -d',d',2u' —d' +8’) 
(u’,d’,6')ESi (n) 
= S- F(d' + 6',u',d’ —6'). 
(u’,d’,6') ES! (n) 

Let w be the map on S(n) defined by w(u,d,6) = (—u,6,d). Then w is 
an involution since w = po. If (u,d,6) € S{(n), then 2u—d+6 > 1, and 
so —2u—6+d<-—1 and 

w(u, d, 6) = (—u, 6,d) € S’,(n). 
Similarly, if (u,d,6) € S“,(n), then 2u—d+6>1, and so -2u—6+d>1 
and 


Therefore, 
p+ Sy(n) > S14 (n) 
is a bijection with yt = ~. Let 
A(x, y, 2) > PYH2 29 — z). 
By Lemma 13.4, 
S> -F(d+6,u,d—6) S° A(u,d,6) 
(u,d,6) ES! (n) (u,d,6) ES} (nm) 
= S>  -H(w(u, 4, 6)) 
(u,d,d)ES" , (nm) 
= Ss A(-u,6,d) 
(u,d,d)ES" ,(n) 
= S> F(6+d,-u,-6- d) 
(u,d,d)ES" ,(n) 


= S > F(d+6,u,d+6), 
(u,d,6)ES" | (n) 


I 
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since the function F(x, y, z) is even in the pair of variables (y, z). 
If (wu, d,6) € So(n), then 


2u-d+6 = 0, 
=: 
— SE 
and , . 
d 
n=w+d5= (“*) +05 = (S*) = 
5 D 
where 
pats 
= ote 


Therefore, the set S4(n) is nonempty only if n is a square. Since the integers 
d and 6 are positive, it follows that 


1<d=20-—6< 2@-1. 
Conversely, if 1 <d < 2€—1, we set 6 = 2€-—dandu=d-—&. Then 


w+dé6 = (d—2£)*+d(2é-d)=C¥=n, 
2u-d+6 = 0, 


and 
(u,d,6) € Sh(n). 


It follows that if n = @? with @> 1, then 


Si(n) = {(d — 0,d,20—d):1<d< 2-1} 


and 
2£—-1 
S> F(d+6,u,d—6) = 5° F(2t,d-£,2d— 26) 
(u,d,6) ES) (n) d=1 
e-1 
= Do F(2¢,5,25) 
j=-f41 
Therefore, 


S° F(d+6,u,d—6) 
(u,d,6)ES(n) 
= 2 Se F(d+6,u,d— 6) + {To(n)}n=02 
(u,d,6)€S! (n) 
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= 2 So F(6-2u,u+d,2u+ 2d—6)+ {Th(n)}ne 
(u,d,6)ES_1(n) 

= 2 So F(6-2u,u+d,2u+ 2d —6) 

(u,d,6)ES_1(n) 

2 So F(6—2u,u+d,2u+2d — 6) + {To(n)}n=e 

(u,d,6)ES1(n) 

2 S° F(6-2u,u+d,2u+2d—6) 

(u,d,6)ES(n) 

— 2{T1(n) }nae2 + {T2(n)}naer- 


I 


This completes the proof of Theorem 13.1. 


13.5 Two Corollaries 


In this section we derive two additional identities that we use in the next 
chapter. 


Theorem 13.6 If F(x,y,z) is a function that is odd in each of the vari- 
ables x,y, and z, and if F(x,y,z) =0 for every even integer x, then 


S> F(6 - 2u,ut d,2u+ 2d — 6) = {To(O)}n=e, 
(u,d,6)ES(n) 
6=1 (mod 2) 


where 


To(0) = S° F(2j -1,6,2j -1). 


j=1 


Proof. Since the function F(z, y,z) is odd in the variable y, we have 
F(x,0,z) = 0 for all x and z, and 


S° F(d+6,u,d—6) 


(u,d,6)ES(n) 
= S° F(d+6,u,d—6) S > F(d+6,u,d—6) 
(w,d,6)ES(n) (u,d,d)€S(n) 
uw u<—-1 
= S° F(d+6,u,d—-6) S> F(d+6,-u,d-6) 
(u,d,6)ES(n) (u,d,6)ES(n) 
u>1 u>1 
= SS) F(d+6u,d-6)- So F(d+6,u,d—-8) 
(u,d,6)ES(n) (u,d,6)ES(n) 
u>i1 u>1 


I 
S 
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Since F(x, y, z) = 0 for all even integers x, we have 


S° F(6—2u, utd, 2ut+2d—6)= S° F(6—2u, utd, 2u+2d-6). 
(u,d,6)ES(n) eee Ge 
If n = é?, then 
2-1 
Ti(@) = >> FG.45) = >) FG -1,4,29 -1) 
j=l j=l 
and 
L-1 
RO= 5) Feh7,27)=0. 
j=—f+1 


The result follows immediately from Theorem 13.1. 


Theorem 13.7 Let f(x,y) be a function that is odd in each of the variables 
x and y. For every positive integer n, 


So (EY? F(6 — 2u,u+ d) = {To(O) bane, 


u2+d6=n 
6=1 (mod 2) 


where 


Proof. We define the function F(x, y, z) as follows: 


0 if x or z is even, 


Flay 2)'= { (—1)¥+* f(x,y) if x and z are odd. 


Then F(z, y,z) is a function that is odd in each of the variables x, y, and 
z, and F(x,y,z) = 0 for every even integer x. By Theorem 13.6, we have 


s F(6 —2u,u+d,2u+ 2d —6) 


u2+d6=n 
6=1 (mod 2) 


= SS (-1)@-Y/? F(6 — Qu, u + d) 


u2+dé=n 
6=1 (mod 2) 


= {To }n=e: 


where 
L 
T(f) = > F(2j-1,£,2j-1) 
j=1 
7 
= UCD FQ - 1,9. 


j=l 
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This completes the proof. 


13.6 Notes 


Liouville’s papers contain the statements of many theorems, but no proofs. 
Dickson’s History of the Theory of Numbers [25], Volume II, Chapter XI, 
“Liouville’s series of eighteen articles,” contains a detailed summary of 
Liouville’s assertions and references to papers by other mathematicians 
who have provided proofs of Liouville’s results. 

Uspensky and Heaslet [145] and Venkov [149] present careful accounts of 
Liouville’s method and proofs of many of his results. 
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Sums of an Even Number of Squares 


The problem of the representation of an integer n as the sum of a 
given number k of integral squares is one of the most celebrated 
in the theory of numbers.... Almost every arithmetician of note 
since Fermat has contributed to the solution of the problem, and 
it has its puzzles for us still. 


G. H. Hardy [52, p. 132] 


14.1 Summary of Results 


For every positive integer s and nonnegative integer n, we let R,(n) denote 
the number of ordered s-tuples of integers (#1,...,2,) such that 


N= ape a. 
The integers x; can be positive, negative, or 0. For every s > 1 we have 
R,(0) = 1, 


since 0 = 0?+---+0? is the unique representation of 0 as a sum of squares. 

We shall apply Liouville’s identities to obtain explicit formulae for the 
number of representations of a positive integer as the sum of s squares, 
where s = 2,4,6,8, and 10. Representing an integer n as the sum of s 
squares is a problem in additive number theory, but the solution, for even 
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values of s, always involves a sum over the divisors of n, a fundamental 
topic in multiplicative number theory. 

In this chapter, d and 6 always denote positive integers, and }~> alts and 
> ,-as denote the sum over the positive divisors of n. 

We write the positive integer n in the form n = 2°m, where a > 0 and 
m is odd. We shall prove the following formulae: 


Riya 4y eye we, 


d|m 
Ran) = { oe a 
Re(n) = 4 (ee - (yer?) > (—1)6-D/2g2, 
m=d5 
Rg(n) = { 16 Xan e : nas odd, 
(16/7)(8°** — 15) alm @° if n is even, 
Rio(n) = : (16°41 + (1-2) > (-1)@-Y/2¢4 
m=d6 


14.2 A Recursion Formula 


Our proofs depend on the following recursion formula for Rs(n). 


Theorem 14.1 For all positive integers s and n, 
S> (n—-(s84+ 1)u*) R,(n- u*) = 0. (14.1) 
lulsV/n 
Proof. If 
maapte bast o41, 


then #2,, <n and so 


|ve44| < Vn. 


For j = 1,..., Rs41(n), let 
s+l1 


fas) 2 
He y 2 j 
t=1 


denote the Rs4i1() representations of n as a sum of s + 1 squares. For 
i=1,...,8, we define the map 7; on the set of (s + 1)-tuples by 


Ti (21, Parr > Uj-1,%j, Vi+1, iy ance , 25,2541) = (21, S233 > Uj—-1,Us41,Vi4+1, oe aya): 
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This is an involution on the set of the R,i1(n) representations of n as a 
sum of s+ 1 squares, and so 


Rs+1(n) Rs41(n) 
y Ceti = y D4 fori =1 s 
jal j=l 


Summing over all representations of n, we obtain 
Rs41(m) s+1 


nRs41(n) = » ds 


eave. 
= 2 
a ba ba, 
(=. 521 


Rs41(n) 


= (stl 2 eH 


= (s+1) s wRs(n—u?), 
|ul<va 


since for ee) integer u with |u| < \/n there are R,(n—u?) representations 
n= ae x; ; with v541,; = u. This also implies that 


Reai(n) = oo R,(n—u*). 


lulsvn 
Then 
NRsi(n) =n oe R,(n— vu”), 
jul<vn 
and 


S- (n —(s+1)u?) R,(n —u?) =0. 
lulsvn 
This completes the proof. 


Theorem 14.2 Let ®(n) be a function defined for all nonnegative integers 
n such that 


and 


lul<Vn 
forn >1. Then 


for alln > 0. 
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Proof. This follows immediately from Theorem 14.1. 
The recursion formula (14.1) enables us to compute R,(n) for all positive 
integers s and n. We have 


nR,(n) = — S- (n — (s+ 1)u?) Re(n — u’) 
1<|ul<vn 
= 2 b> ((s + 1)u* —n) R,(n— uv’), 
L<u< vn 


and so 


R.(n)=2 >> a = 1) R,(n— v2). (14.2) 


1<u<Vn 


For example, for s = 3 we have 


R3(1) = 2(42-1) R3(1-12) = 6, 
R3(2) = 2(4 -1) R3(2- 1?) 1 4 
R3(3) = 2(4°-1) R(3-12) = 8, 
R,(4) = 2((42-1) Re(4—12) + (42 -1) Ra(4 - 2) 6, 
R(5) = 2((42 -1) Ra(5—12) + (42 -1) Ra(5 — 2”) Bae « 
R(6) = 2((4"°—-1) R,(6-17)+ (4* -1) R3(6-27)) = 24 , 
R(7) = 2((4°-1) Ra(7-12)+ (42 -1) Ra(7 - 2”) 0, 
R(8) = 2((4¢-1) Ra(8—-12)+ (42-1) R,(8-22)) = 12 
Exercises 


1. Prove that R,(n) < R,41(n) for all positive integers s and n. 


2. Use induction on s to prove (without using Theorem 14.1) that R,(n) 
is even for all positive integers s and n. 


3. Use the recursion formula (14.2) to compute R2(n) and R4(n) for 
n<sg. 


4. For positive integers k and s, let Rx,s(n) denote the number of s- 
tuples of integers such that 


get... toh =n, 
Then Rs(n) = Ro,s(n) and Rox,5(0) = 1. Prove that 
S> (n= (8+ 1u?*) Rox,s(n — u?*) = 0 
Jul<nt/2k 


for every positive integer n. 
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5. Let & and s be positive integers. Prove that 


Rox,s(1) = 25. 


6. Let k and s be positive integers, and let 0 <n < 4*. Prove that 


Rox,s(n) = 2” a 


7. Let s > 3. Show that R3,,(n?) = oo for every integer n. 


8. For positive integers k and s, let r;,,,(n) denote the number of s-tuples 
of nonnegative integers such that 


Prove that rz,5(0) = 1 and 


S- (n — (s+ 1)u*) ri,5(n — u*) =0 


O<u<nl/k 


for every positive integer n. 


14.3 Sums of Two Squares 


Recall that S(n) is the set of all triples (u,d,6) of integers with d,6 > 1 
and u? + dé = n. 

If ky and kz are odd integers, then the function f(x,y) = x*y*? is odd 
in each of the variables x and y. Applying Theorem 13.7, we obtain 


So (-1)@-P2(6 — 2u)* (d+ uy 


u24dé=n 
5=1 (mod 2) 


L 
= Abe Oy =) (14.3) 


n= 


We shall use this identity for various values of k, and kz. We can simplify the 
sum on the left by noticing that (u,d,6) € S(n) if and only if (—u,d,6) € 
S(n). This implies that if & is an odd integer and g(d,6) is any function, 
then 

S> uk g(d,6) = 0. (14.4) 


u2+d6=n 
6=1 (mod 2) 
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Since (u,d,6) € S(n) if and only if (u,6,d) € S(n), it also follows that if 
e(d, 6) = e(6,d), then 


S> e(d,6)(d— d)h(u) =0 (14.5) 


u2+d6=n 


for any function h(u). 

In this section we shall obtain a formula for the number of representations 
of an integer as the sum of two squares. By Theorem 14.2, it suffices to 
construct a function ®(n) such that 6(0) = 1 and 


S- (n — 327) ®(n — 2”) =0 
|alSvn 
for every positive integer n. 


Theorem 14.3 


R2(n) =4)(-1) OV? =4 So 1- Sool 
dln 


d\n d|n 
d=1 (mod 4) d=3 (mod 4) 


Proof. The function f(x,y) = xy is odd in each of the variables x and 
y. The left side of identity (14.3) is 


So (DEY F(6 - Qu, d+ u) 


u24+d6=n 
6=1 (mod 2) 


= S- (—1)&-)/2(6 — 2u)(d + u) 


2 So (-D)&- YP (db — 2u? + 6u — 2du) 


u2+dé=n 
6=1 (mod 2) 


= SS (-1)@- YP (dé — 2u?), 


u24+dé=n 
6=1 (mod 2) 


by (14.4) with k = 1. 
If n = ¢?, then (by Exercise 1) the right side of the identity (14.3) is 


L 


T(€) = €S0(-1)°9(25 -1) 


j=l 
= 2 


= n. 
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Therefore, 


S> (HI) OVP (dd — 2u?) = {To(8)}n=e- 


5=1 (mod 2) 


If d and 6 are positive integers and 


n=u? + d6, 
then 
Jul << /n 
and 
dé — 2u? =n — 3u?. 
Therefore, 
So (-D& VP (ds -2u?) = So (-D&E PP (n= 3u*) 

u2+d6=n u2+d6=n 

6=1 (mod 2) 6=1 (mod 2) 
= Ws (n — 3u?) os yee, 
Jju|<n 6|(n—u?) 


6=1 (mod 2) 


Define the function 6(n) by ®(0) = 1 and, for every positive integer n, 


®(n)=4 SO (-1)@VP. 


5|n 
5=1 (mod 2) 


Then 
SS] (n-3u?)®(n — uw?) = {4} nae. 
lul<V/n 
If n is not a square, then 
SS (n-3w)®(n—w?) = S° (n—3u?)O(n— u?) = {An} nae = 0. 
lul<Vn lul<V/n 
If n = @ is a square, then 
S- (n—3u7)®(n—u?) = ‘> (n — 3u7)®(n — u?) 
lulsVn lul</n 
+(n — 3m*)®(0) + (n — 3(—m)?)®(0) 
= {4n}y=02 —2n—2n 
= 0. 
Therefore, 
Ro(n) = O(n) 


for all positive integers n. This completes the proof. 
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Exercises 


1. Prove that for every positive integer £, 
é 
Sey ae. 
j=l 
2. Let p be a prime number such that p=1 (mod 4). Prove that 
Ro(p*) = 4(k +1). 
3. Let p be a prime number such that p=3 (mod 4). Prove that 
ky _ J 4 if k is even, 
Ra(p j= 0 if kis odd. 


4. Define the divisor functions 


dy (n) = S- 1 


d\n 
d=1 (mod 4) 


d\n 
d=3 (mod 4) 


and 


Prove that d(n) > d3(n) for every positive integer n. 


5. Let p bea prime number, p=3 (mod 4). Prove that if n = p?*-!m, 
where (p,m) = 1, then 


and 


Deduce that n cannot be written as the sum of two squares. 


6. An arithmetic function f(n) is called multiplicative if 


f(nin2) = f(ni)f (m2) 


for all positive integers n; and ng such that (n1,n2) = 1. Define the 
function y(n) by 


0 if n is even, 
x(n)=4 1 ifn=1 (mod 4), 
-1 ifn=3 (mod 4). 
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Prove that x(n) is multiplicative. 
Prove that 


Ro(n) = 5° x(n). 
dl 


Prove that R2(n) is multiplicative. 


Hint: If (ni, n2) = 1 and dis a divisor of nine, then there exist unique 
divisors d, of n; and dz of ng such that d= djdo. 


7. The divisor function counts the number of positive divisors of n, that 


is, 
d(n) = S- 1. 
dln 


Prove that d(n) is a multiplicative function, and that 
R2(n) < 4d(n) 
for all positive integers n. 


Hint: Since Ro(n) and d(n) are both multiplicative functions, it suf- 
fices to to check the inequality for prime powers. 


8. Prove that lim inf, ... Re(n) = 0. 


9. Prove that lim sup,,_,,, Re(n) = oo. 


14.4 Sums of Four Squares 


In this section we prove Jacobi’s formula for the number of representations 
of an integer as the sum of four squares. 


Theorem 14.4 (Jacobi) For every positive integer n, 


Ra(n)=85 od ifn is odd, 
dl 


and 
R4a(n) = 24 >» d if n is even. 


Proof. By Theorem 13.1, if F(x, y,z) is a function of integer variables 
x,y,z that is odd in x and even in the pair (y, z), then 


2 So F(6-2u,u+d,2u+2d-5)- S> F(d+6,u,d—6) 


u2+db=n u2+dé6=n 
2e@-1 e-1 

= (2) FG,45)- >> F(26,3,25) 
j=l j=-441 


n=l? 
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The function (—1)’F (a, y,z) is also odd in x and even in the pair (y, z). 
Applying Theorem 13.1 to the function (—1)*F (2, y, z), we obtain 


2 So (-1)'F(6—2u,u+d,2u + 2d — 6) 


u2+dé=n 
— bp (-1)*° F(d+ 6,u,d—6) 
u2+d6=n 
2¢-1 é-1 
= 420-1 FG4I- DD F(265,23) 
j=l ja—l4+1 dis 
Adding these identities gives 
4 S> F(6-2u,u+d,2u+ 2d — 6) 
u2+d6=n 
6=0 (mod 2) 
-2 So F(d+6,u,d-6) 
u2+d6=n 
d=6 (mod 2) 
é-1 
= (4 So FG,69)-2 So F(2L,5,23) . (146) 
1<j<2e-1 ja—l+1 
j=0 (mod 2) n=l2 
Subtracting these identities gives 
4 SS) F(6-2u,utd,2u+ 2d — 6) 
u2+4+dé=n 
6=1 (mod 2) 
-2 So F(d+6,u,d—-6) 
u2+d6=n 
d=-—6 (mod 2) 
Sidd) «SS RUGS) ' (14.7) 
1<j<2e-1 
j=i (mod 2) 2 


The function 


Cee = 0 if x or z is odd, 

2) ) (-1)(@+2)/2F(a2,y,z) if @ and z are even 

is also odd in the variable x and even in the pair of variables y, z. Applying 
identity (14.6) to the function G(z, y, z), we obtain 


4 SS” (-1)4F(6 -2u,u+t d,2u+ 2d — 6) 


u24+d6=n 
6=0 (mod 2) 
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-2 S° (-1)4F(d+6,u,d-6) 


u24+d6=n 
d=6 (mod 2) 


e-1 
= 44 So FG,8j)-2 S> (-1)9F(26,5,25) > (14.8) 
1<j<2e-1 j=—l+1 
j=0 (mod 2) n=l? 


Subtracting (14.7) from (14.8) and dividing by 2, we obtain the important 
identity 


1 
2 SS (aa) (F (5 — 2u,u+d,2u-+2d — 4) — 5F(d+6,u,d ‘)) 
uz+d6=n 
2k-1 ; l-1 ; 
= 925° (-1) 7FG,4A—- >> (1) F(26,5,29) > (149) 
j=l j=—@41 nal? 
where 
1 if d and 6 are even, 
ce a if d or 6 is odd. 


The formula for R,(n) follows immediately from applying this identity to 
the function 

F(x,y,2) = xy”. 
We obtain on the left side 


2 Se ~ 2u)(u +d)? 5(a-+ 8)u2) 

uz+d6=n 

= 2 Soe 6) (26+ 2d5u + 15u2 Qu ie 2d7u 
aa D 2 


I 
i Mo 


e(d, 6) (a(n u’) + sou stu?) (by (14.4)) 


u2+d6=n 

= 2 So e(d,d)d(n—5u?)- So e(d,6)(d—d)w? 
u2+d6=n u?+d6=n 

= So (n—-5u’)2 S° e(d,d)d (by (14.5)). 
ur<n n—u2=d6 


If n = £?, the right side of (14.9) is 


2f-1 


20 N° (-1) I 1; 20 wre 1)+55 es 4 - 1- —Ij 
j=l j=—t41 
a. Bi Vie 
2 
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and so 


yi n—5u7)8 S- e(d,d)d = {8n},=02. 


ur<n n—-u2=dé 


Define ®(0) = 1 and 
n)=8 S> e(d,d)d 
n=dé 


for n > 1. If n is not a square, then 


ur<n ur<n 


If n is a square and n = @?, then 


S> (n= 5u?)®(n) 


= So (n-5u?)®(n) + > (n-5u?)8(n) 
uz<n u=te 

a S- (n — 5u?)®(n) — 8n 

= 0. 


Therefore, 


n)=8 S° e(d,d)d 


n=d6 


for all positive integers n. 
If n is odd and n = dé, then e(d,6) = 1 and 


(n)=85 od. 
dln 


If n is even, then n = 2°m, where a > 1 and m is odd. Every divisor of 
n can be written uniquely in the form 2°d, where 0 < b < a and m = dé. 
Then 


Rin) = 8 >> 5 2e(2 (2%d,2-%6)2 
m=dé b=0 
= 8) > e(d,2%5)d+8 S> e(2%d,5)2°d 
m=dé m=dé 
a-1 
+8 S> > e(2'd,2*->6)2°d 
m=dé b=1 


a-1 
= 8) d+8 5° 2%d-8 5) Saha 


m=dé m=déb m=dé b=1 


14.4 Sums of Four Squares 


8 5) d+8 S$) 2%d-8(2°-2) So d 


m=dé m=déb m=déb 
24 Sod 

m=dé 

4 a 


d\n 
d=1 (mod 2) 


This completes the proof. 


Exercises 
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1. Prove that R4(2") = 24 for all k > 1. Find all representations of 2” 
as a sum of four squares. 


2. Prove that 


for alle > 0. 


3. Compute R,(p") for all odd primes p and k > 1. 


4. Prove that 


5. Prove that 


for n > 2. 


R4(n) < 24nlogn 


6. Prove that for every positive integer @, 


and so 


and so 


Sn = |), 


j=l 
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14.5 Sums of Six Squares 


In this section we obtain an explicit formula for Rg(n). The idea is to apply 
identity (14.3) to the monomials x?y and zy?, and to manipulate the results 
so that we can find a function ®(n) that satisfies the recursion formula 


D> (n — 7x?) ®(n — 2”) = 0. 
|al<V/n 
Theorem 14.5 Let n be a positive integer, 
n= 2%m, 
where a > 0 and m is odd. Then 
Re(n) = 4 (4844 — (1-9?) So (-yVP@. 
m=dé 


As an example, we shall describe the representations of 5 as a sum of 
six squares. There are 2°(2) = 192 representations as a sum of five terms 
(+1)?. There are 27($)(?) = 120 representations as a sum of (+1)? and 
(+2)?. Thus, there are 312 representations of 5 as a sum of six squares. 

We can also compute this number by applying Theorem 14.5 with a = 0 


and m = 5. Then 
Ro(5) =4 (4 = (Ie?) (5241) =4-3-26 = 312. 


Proof. The function f(x,y) = x°y is odd in each of the variables x and 
y, and so we can apply (14.3) with k; = 3 and kp = 1. The left side of this 
identity is 


So (DEPP (6 = 2u)?(u + d) 


u2+dé=n 
6=1 (mod 2) 


= So (-1) OP? (ub? — 6u?6? + 12u36 — 8u4 + dd? — 6ud6? 


u2+dé=n 
6=1 (mod 2) 


+ 12u*dé — 8u°d) 


= S> (-1)@-Y/? (83 — 6u?6? + 12u7d6 — 8u*) 


u2+dé=n 
6=1 (mod 2) 


So (-DE-VP (6 (n = Tu?) + 4u?(3n — 5u?)). 


u2+dé=n 
6=1 (mod 2) 


If n = ¢?, then (by Exercise 3) the right side of the identity is 


I 


£ 
To(€) = (-1) MS 0(-1)** (2k — 1) 
k=1 
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(—1)* 14(-1)?-1 (48 — 32) 


l 


= 44-3? 
An? — 3n. 
Therefore, 
So (-1)@ P96? (n — Tu?) + 40? (8n — 5u?)) = {4n? - 3n} poe. 
u24d6=n 


5=1 (mod 2) 


(14.10) 
Next we apply (14.3) to the function f(a, y) = ry?. The left side of the 
identity is 


So (DEPP (6 — 2u)(u+ a)? 


u2+dé=n 
6=1 (mod 2) 


= S (-DE-PP (u35 + 3u?dd + 3ud?6 + 436 — 2u4 — ud 


u2+d6=n 
6=1 (mod 2) 


— 6u7d? — 2ud?) 
= SS (-)@-Y (a6 — 6u?d? + 3u?dé — 2u") 


u24+dé=n 
6=1 (mod 2) 


= Ss SO YR@ a= 7) +7 Bn =—5u7)). 


u24+d6=n 
6=1 (mod 2) 


If n = @?, then (by Exercise 1) the right side of the identity is 


e 
Tot) = (—1) 18 S71)" 12k — 1) 
7 P k=1 


Multiplying by 4, we obtain 
So (-1)&-Y?? (4a? (n—7u?) +40? (8n—5u*)) = {4n7}aae. (14.11) 


u24+d6=n 
6=1 (mod 2) 


Subtracting equation (14.10) from equation (14.11), we obtain 
So (-DEPP (n= Tu?) (4d? — 6?) 


u2+dé=n 
6=1 (mod 2) 


= So(r-m) So (-1)@-Y/ (4d? — 6?) 


Jul<n d6=n—u2 


6=1 (mod 2) 
=> {8n} n= é 
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Let ®(0) = 1. For every positive integer n, define 
G(n)=4 SY) (-1)@-D/?(4¢? — 6?). 
6=1 eee 2) 


If n is not a square, then 


by (n — Tu?) ®(n — u?) = 5 (n — Tu?) ®(n — wu?) = 0. 


Jul<n Jul<n 


If n = @ is a square, then 


S- (n — Tu”) ®(n — u?) 


Jul<n 
= SS (n — Tu?) ®(n — u?) + (n — 70?) 6(0) + (n — 7(—£)) &(0) 
|ul<n 
= 12n—12n 
= 0. 
Therefore, 


Re(n)=O(n)=4 SS (-1)-Y/(4d? — 67). 
6=1 een 2) 
We rewrite this equation as follows. Let n = 2m, where a > 0 and m is 


odd. Then 6 is an odd divisor of n if and only if there exists a divisor d, of 
m such that d = 2%d, and m = d;6. Therefore, 


oS (<8 a4¢? = 4 So (- y— 1) )/2 (2° dy Ne 


d,6=m 


SS ea. 


dy6=m 


dé=n 
5=1 (mod 2) 


By Exercise 4, if m is odd and d,é = m, then 
Epi ene ar = epee 


It follows that 


S- (-1)@-D/2g2 = s. (—1)6-0/262 


dy6=m 


= SS Ca ea 


dé=m 


= (See S- (—1)8-D/2@2, 


db=m 


d6=n 
5=1 (mod 2) 
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Therefore, 
Ro(n) = ®(n) = 4 (4244 — (1D?) SX (1) 92. 
db=m 
This completes the proof. 
Theorem 14.6 For all positive integers n, 
3n? 


“2” < Re(n) < 40n?. 


Proof. Let n = 2m, where a > 0 and m is odd. The infinite series 
¢(2) = OP. k-? converges, and ¢(2) < 2 by Exercise 5. Then 
(hee 


Syed = m S- 7 


dé=m db=m 


IA 
S, 
M 

Ble 


db=m 
2 — 1 
ae eee 
k=1 
< 2m? 
and 
geri ee Epon < 4. 4e +1 
< 5(2°) 
Therefore, 
Re(n) = 4 (42 = (DP) SO (epee 
db=m 
< 4-5 (2%)? 2m? 
40n?. 
This gives the upper bound. 
To obtain a lower bound, we have 
6—-1)/2 72 2 (-1) OV? 
Tyenre = my CO 
dé=m db=m 


IV 
S, 
t 

bg 
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by Exercise 6. Also, 


Therefore, 


Re(n) 


geri _ nets 


Fe co 
S(O 


IVIV 


4 ae = (1-2) > (—1)@-D/2@2 


dé=m 
2 
m 
Pe 
30 
3n? 
5 


This completes the proof. 


Exercises 


1. Find all representations of 6 as a sum of 6 squares. 


2. Find all representations of 10 as a sum of 6 squares. 


3. Prove that for every positive integer m, 


L 
S(-1)!4 (27 — 1) = (408 — 30). 


j=1 


4. Prove that if m is odd and dé = m, then 


5. Prove that 


(Si eae = (ee: 


Hint: k-? < ie, a” *de for k. > 2. 


6. Prove that 


Hint: 4(2k +.1)~? < k7-?. 


7. Use the fact that ¢(2) = 77/6 to prove that 


Sek 2 


1 T 


Qk+1j2 24 


1=0.23.... 
k=1 
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14.6 Sums of Eight Squares 


Theorem 14.7 Let n be a positive integer. If n is odd, then 

n) =165—d¥. 

d\n 

If n is even and n = 2%, where a > 1 and m is odd, then 

16(8¢++ — 15 

ralr) = WOM) 
d|m 
Proof. We shall apply Liouville’s identity (Theorem 13.1) to the three 

polynomials (—1)¥ry*, (—1)¥xy3(2y — z), and (—1)¥zy?. 


Inserting (—1)¥xy* into Liouville’s identity, we find that the first term 
on the left is 


2 So (-1)"*4(6— 2u)(ut a4 

u?+d6=n 

= 2 S> (14 (d46 — 8u2a? + u46 — 8utd + 6u7a?5) 
u?+d6=n 

= 2 SP (-1"*4 (a(n - 9u?) + w*(5 — 14d) + 6nu?d) . 
u24+d6=n 


The second term on the left side of the identity is 


So (-I"(d+ 6ut=2 S¢ 


u2+db=n u2+d6=n 
If n = é?, then 
2l-1 
27; (€) = (-1)'26e* S> j = (-1)* (40% — 20°) 
j=l 


by Exercise 2, and 


f-1 f-1 
T(0) = 2 YS (-1)7j4 = 45 °(-1)77* 
j=—f4+1 j=l 


= (-1)'"! (2° — 4¢* + 207), 
and so the right side of Liouville’s identity is 


27, (€) — Ta(£) = (—1)£ (40° — 42? + 207) = (—1)"(4n? — 4n? + 2n). 
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Dividing by 2, we obtain 


So (-)*t4a(n-9u?) + S2 (-1)"ut ((-1)4(6 — 14d) - d) 
u2+db=n u2+dé=n 
+6n So (-1)"*4du? = {(-1)"(2n? — 2n? + n)}n=e2(14.12) 
uz+d6=n 
Next we consider the polynomial (—1)¥xy3(2y—z). The first term on the 
left side of Liouville’s formula is 


2 SY) (-1)"14(6 - 2u)(ut d)%6 
(u,d,6)ES(n) 
= 2 So (-1)*t4 (3d6?u? + d36? — 26u*4 — 6d?6u?) 

(u,d,6)ES(n) 

= 12 S- (—1)"*+4(36u?(n — u?) + d(n — u?)? 
(u,d,6)ES(n) 

— 26u* — 6du?(n — u?)) 

2  S>  (-1)"*4 (nu?(36 — 8d) + u4(7d — 56) + 7d). 
(u,d,6)ES(n) 


The second term on the left is 


I 


S> (-1)"(d+ 6)u3(2u — d+ 6) 2 So (-1)*(d+ d)u4 


u2+dd=n 24d6=n 
= 4 3S (-1)"dut. 
u2+dé=n 
If n = é?, then 
2-1 
2T1(€) = 25) (-1)j@ (2-35) 
j=l 
2-1 2-1 
= (-1)'40* N° j- (-1)'28 SS 9? 
j=l j=l 
— (-1)"2(4n3 — n?) 
7 3 
and 
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Therefore, 
2 S° (-1)"44 (nu?(36 — 8d) + u4(7d — 56) + 7d) 
uz+d6=n 
—4 Se 1)“du4 
u2+db=n 


{= eee “ey 


or, equivalently, 


3 S> (-1)"ut ((-1)4(7d — 56) — 2d) 


u2+db=n 
+3n S> (-1)"*4u?(36 —8d)+3n? So (-1)"*4d 
u2+d6=n u2+dé=n 
= {(-1)"(4n? — n*)} a (14.13) 
For every positive integer n we have 

S> (-1)"u! ((-1)4(6 - 14a) - a) 

uz+db=n 

+3 S° (-1)"u4 ((-1)4(7d — 56) — 2d) 
u2+db=n 
= 75) (-1)"ut So ((-1)4(d— 26) - d) 
ur<n n—u2=d6 

= 0 


by Exercise 3. Adding equations (14.12) and (14.13), we obtain 


S> (-ny"t4a3(n— 9u?)+9n  S> (-1)"*4u?(5 — 2d) 
u2+db=n u2+d6=n 
+ 3n? So (-1)"44d = {(-1)"(6n3 — 3n? + n)}paee-(14-14) 
u2+dé=n 


Finally, we consider the polynomial (—1)¥ay?. The left side of Liouville’s 


identity is 


2 So (-1)"t4(6-2ulutdy?—- S2 (-1)*(d+ dw? 


u2+d6=n u2+dé=n 
= 2 SO (-1)"*4(u?(6—5d)+nd)-2 S> (-1)"du?. 
uz+d6=n u2+d6=n 
If n = é?, then 


27, (£) — To(£) = (—1)*(40* — 207) = (—1)"(4n? — 2n). 
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Multiplying by 3n/2, we obtain 


3n SY) (-1)*u? ((-1)4(6 - 5d) — d) +3n? SO (-1)"*4d 


u?+d6=n u2+d6=n 

= 9n So (-1)"*4w? (62d) +3n? S* (-1)"44d 
u2+d6=n u2+d6=n 

= {(-1)"(6n° - 3n?)}uce, (14.15) 


since 


S° ((-1)%(6- 5d) -d) =3 SO (-1)4(6 - 2d) 


n—-u2=dé n—-u2=déd 


by Exercise 3. Subtracting (14.15) from (14.14), we obtain 


S> (-1)"*4a8 (n -— 9u?) = {(-1)’n} nae. 
u2+d6=n 
We define the function ®(n) as follows: 
6(0) =1 


and 


®(n) = 16(-1)” §_(-1)%a° 


d\n 


for every positive integer n. If n is not a square, then 


+2 (n—9u?)®(n-—u?) = S- (n — 9u?)®(n — u?) 


u2<n ur<n 
= 165 (n-9w)(-1I)"™" SS (14a? 
ur<n n—u2=d6 
= 16(-1)” S> (n-9u?)(-1)* $5 (-1)4d 
ur<n n=dé 
= 0. 
If n = é?, then 
.> (n — 9u?)®(n — u?) 
ur<n 


= Se (n — 9u?)®(n — u?) + > (n — 9u?)®(n — u?) 


ur<n u=te 


= 16(-1)" S) (n—9u?)(-1)"_ $0 (-1)4d* — 16n 


ur<n n—-u2=dé 
= 16(—1)"{(—1)"n}ace —16n 
= 0. 
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The recursion formula (14.2) implies that 
Re(n) = &(n). 


We can rewrite the expression for Rg(n) as follows. Let n = 2%m, where 
a > 0 and m is odd. The odd divisors of n are precisely the divisors of m. 
The even divisors of n are the numbers of the form 2°d, where 1 < b < a 
and d is a divisor of m. Then 


G(n) = 16(-1)" 5) (-1)4@ 


n=dé 
= 16(-1)” 2 Svar - Soa 
b=1 d|m d|m 


= 16(—1)” (o#-1) ne 
b=1 d\m 

of Betas i 

16(—1) ( = Ee 


I 


This completes the proof. 


Exercises 


1. Prove that for every positive integer n, 
128¢(3 
16n® < Rg(n) < (ae ) n>, 


where ¢(3) = 072, k-?. 


2. Prove that for every positive integer 2, 


3. Prove that 


= (144 - 26) — d) =0 


n=dé 
for every positive integer n. 


14.7 Sums of Ten Squares 


We shall determine the number of representations of an integer as a sum 
of ten squares. In this case the formula for Rig(n) contains two terms. The 
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first is a divisor function, that is, a sum over divisors of n, and the second 
is a sum over representations of n as a sum of two squares. 
Theorem 14.8 Let n be a positive integer, 
n= 2°m, 
where a > 0 and m is odd. Then 


4 a ear) 
Rio(n) = (1641 Se 1/2) Ss" GH¢ 1)/2 44 
m=déb 


16 
+ 5 SS (v4 = 3u7w") . 


n=v2+w2 


As an example, we list the representations of 5 as a sum of ten squares. 
There are a>) = 32 - 252 = 8064 representations as a sum of five terms 
of the form (+1). There are 2?(1”)(?) = 360 representations as a sum 
of the integers (+1)? and (+2)?. Thus, there are 8424 representations. By 
Theorem 14.8, with n = m= 5 and a= 0, we have 


4 16 
Rio(5) = ede tA) ela. S> (a* = 3ay’) 
5=22+y?2 
42 1 

= oe = (a(24 3-27) + 4(1* — 3-2)) 

_ 42568 448 

eS ey 5 

= 8424. 


Proof. By Theorem 14.2, it suffices to find a function ®(n) such that 


®(0) =1 and 
S- (n — 1127) ®(n — x?) =0 
lel<vn 
for every positive integer n. 
We begin by applying identity (14.3) to each of the monomials x°y, x3 y°, 
and zy°. With f(x,y) = x°y, we obtain 


So (-DE-YP (6 — 2u)?(ut d) 


= S- aye 


x S- (j,) (ayer eubt + S- (j,) (ays! 


O<k<5 O<k<5 
k=1 (mod 2) k=0 (mod 2) 


De 


u2+dé=n 
6=1 (mod 2) 


+ 80d6u* 


> 


u2+dé=n 
6=1 (mod 2) 


— 8067u* 


De 


u24dé=n 
5=1 (mod 2) 
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(—1)-/2(d6> — 1054u? + 40d6°u? — 806? u4 


— 32u®) 
(—1)°-9/2(54(n — u?) — 1054u? + 4082u?(n — u?) 


+ 16u4(5n — 5u) — 32u°) 
(—1)@-D/2(64(m — 11u?) + 4067u?(n — 3u?) 


+ 16u4(5n — 7u?)) 


L 
£>—(-1)*4(25 - 1) 


n=l? 


{16n? — 40n? + 25n}, yo 


by Exercise 4. 
Applying (14.3) with f(x,y) = 2°y, we obtain 


Se) 


u2 


+dé=n 


5=1 (mod 2) 


is 


u2+dé=n 


(-Y/2(§ — 2u)3(u+ d)? 


(-—1)-)/? (3d63u? + 12d°6u? — 66?u* — 24d?u4 + d36% 


5=1 (mod 2) 


— 18d?6 


y 


u2+dé=n 


2u? + 36d6u* — 8u®) 
(—1)©-9/? ((367u? + 12d?u?)(n — u?) 


5=1 (mod 2) 


— (367u? + 12d?u”)2u? (dé — 2u?)((d6é — 2u”)? — 12d6u")) 


» 


u2+d6=n 


(—1)°-)/2((367u? + 12d?u?)(n — 3u?) 


5=1 (mod 2) 


+ (n — 3u?)? — 12u?(n — u?)(n — 3u?)) 


£ 


LS ye a1) 


j=1 


An? — 3n? 
{ 


pals 


n=l? 


by Exercise 3 in Section 14.5. 
Applying (14.3) with f(a, y) = ry°, we obtain 


2S 


u2+d6=n 
6=1 (mod 2) 


(—1)°-Y? (6 = Qu) (a +d)? 
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= S> (-1)@-P/7(d°6 — 10d*u? + 10d%Su? — 20d? u* 


u2+dé=n 
6=1 (mod 2) 


+ 5d6u* — 2u®) 
- S- (—1)®-)/? (d4(n — 11u?) + 10d?u?(n — 3u?) 


u2+dé=n 
6=1 (mod 2) 


+ u4(5n — 7u?)) 
l 
= OC @y-1) 


nal 


by Exercise 1 in Section 14.3. 
The upshot of this analysis is the following three identities: 


SS (-1)@-P7(64(n — 11?) + 406?u?(n — 3u?) 


u2+dé=n 
6=1 (mod 2) 


+ 16u4*(5n — 7u7)) = {16n? — 40n? + 25m} _,n, (14.16) 
S> (-1)@-D/? ((36?u? + 12d?u?)(n — 3u) + (n — 3u?)? 


u2+dé=n 
6=1 (mod 2) 


— 12u?(n — u?)(n — 3u?)) = {4n? -— 37} _,., (14.17) 
So (-DE-PP (dk (n = Lu?) + 10d?u? (n — 3u*) 
u24+d6=n 
6=1 (mod 2) 
+ ut(5n — 7u?)) = {n ree. (14.18) 


We shall eliminate the terms 


S- (-1)0-DP Pu (n — 3u7) 
u2+d6=n 


5=1 (mod 2) 


and 
S- (—1) 0-9/2 62 u? (n — 3u7) 
u24+dd=n 
5=1 (mod 2) 
from these equations as follows: Multiply equation (14.18) by 16 and add 
to equation (14.16), then multiply equation (14.17) by 40/3 and subtract. 
We obtain 


ys (AC Ge a 6d? 6) > (—1)6-D/2 


u2+dé=n u2+d6=n 
6=1 (mod 2) 6=1 (mod 2) 
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40 
x (160%(n — u?)(n — 3u?) + 32u4(5n — Tu") (n — 3) 


3 
= {250 — ea \ 
3 n=l? 


Let P(n) denote the first sum in this equation, and let Q(n) denote the 
second sum. Then 


n°? 
Pln) ~ {25n}uae+Q(n)+ {HI 7 


For positive integers n we define the function y(n) by 


g(n)= So (-1)@ YP (16 d* + 64). 


n=dé 
d,6>1 
5=1 (mod 2) 


Let 


Then 


P(n) = So (-D&-PP(n = 11u?) (16d* + 64) 


n=u24dé 
6=1 (mod 2) 


So (n-lw) SO (-)° PP (16a* + 6) 


uz<n 


I 


n=d6é 
5=1 (mod 2) 


= S- (n —11u?)y(n — u?). 


wen 


If n = @ is a square, then 


sS- (n—11u?)y(n—u?) = (n—116?)y(0) + (n — 11(—£)7) y(0) 
u=tl 
= (-20n)? 
= —25n, 
and so 


P(n) — {25n}nse = > (n— 11u?)y(n - uv”). 


u2<n 


Recall the formula for the number of representations of an integer as the 
sum of two squares: 


R(n)=4 SO (-IOV”, 


5|n 
6=1 (mod 2) 
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am) = Y yee 


utidé=n 
6=1 (mod 2) 


x (100u%(n —u?)(n — 3u?) + 32u4(5n — Zu?) — a(n — w?)) 


x4 SO (-neve 


6|(n—u?) 
6=1 (mod 2) 


= y (40. —u?)(n — 3u?) + 8u4(5n — 7u?) — s(n — w)) 
x Re(n — u?). 


If n = @7, then Ro(n — £2) = R2(0) = 1 and 


> (4ou(n — u?)(n — 3u) + 8u4(5n — 7u?) — = (n - sut)) 
u=e 

x Ro(n — u?) 

64n3 


3 o) 


and so 


00+ de 


» (sown — u?)(n — 3u*) + 8u4(5n — Tu?) — a(n 2 3u2)) 


x Ro(n — u?) 


= »s (sown —u?)(n — 3u?) + 8u4(5n — 7u?) — s(n — 3u?)*) 
wen 
x ay 1 
n—u? =v? 4+w? 


= > (40w%(n — u?)(n — 3u?) + 8u4(5n — 7u?) — s(n _ su’) 


n=u2+v2+w? 


= S- (40u?(v? + w?)(v? + w? — 2u?) + 8u4(5u? + 5w? — 2u?) 


n=u? +0? +w? 


1 
= ov +w? — 2u?)3) 
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| SS 120u20? w? 


n=u2+v2+w2 


(=e 10v® oe 
n=u2+v2+w2 3 3 
+ > (60utv? — 80u7u* + 60u*w? — 80u2w* — 10v*w? — 10v7w*) 
n=u2+v2+w2 
= 4 a (u® — 15uty? + 30u7v*w") ; 
n=u2+v2+4+w2 
The simple form of the last equation arises from a symmetry argument: If 
h(u,v, w) is any function and o is any permutation of u,v, and w, then 


YS h(ur,w)= YS (ou), o(v), o(w)). 
n=u2+v2+4+w2 n=u2+v2+w2 
For every nonnegative integer n we define the function 
v(n)= So (vt —3v?w?). 
n=v2+w2 
Then w(0) = 0, w(1) = 2, w(2) = —8,..., and 
S- (n — 11u”)b(n — u?) 


uz<n 


- SS (n — 11u?) S- (v* — 3v?w?) 


ur<n n—u2=v2+w?2 


= 2 (n — 11u?)(v* — 3u7w?) 


n=u2+v2+w? 


= Ss (v2 + w? — 10u?)(v* — 3v?w?) 


= s (v® — 2u4w? — 3u?w* — 10u?u4 + 30u2v2w?) 


= S> (u® = 15utv? + 30u?v?w") by (14.5). 
n=u2+v2+w?2 
Therefore, 
3 
Q(n) {= \ = 437 (n—1w)¥(n- wv). 
3 n=l 2 
ur2<n 
We define ji F 
Bn) = ein) + 4¥(n)) 
5 
Then 
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and 


SE (n — 11u?)6(n — u?) =0 


u2<n 


for all positive integers n. It follows that 


Rio(n) = = (y(n) + 4¢(n)) 


So (De PP6d* + 64)+ = SY) (vt 30? w?). 


dé=n 
6=1 (mod 2) 


Oo Bou wR 


n=v2+w? 


Let n = 2%m, where m is odd and a > 0. Since n = dé with 6 odd if and 
only if d is of the form d = 2d,, where d is a divisor of m, then it follows 


that 
> (ye) 16 cs 16211 ye (CO eae 
dé=n d,é6=m 
6=1 (mod 2) 


Moreover, if m = d,6, then 


(pe = ier ee ape 


and 


> (10g = S- (ph Pi es 


doe d,é=m 
6=1 (mod 2) 
- S- (—1)(@-D/2q4 
dyé=m 
= (Se ae S- (=1)@-D/2¢4, 


d,é6=m 


This completes the proof. 


Exercises 


1. Compute Rio(n) for n= 1,...,10. 

2. Find all representations of 10 as a sum of 10 squares. 
3. Find all representations of 6 as a sum of 10 squares. 
4. Prove that for every positive integer £, 


L 
S > (-1)*-4(29 — 1) = 166° — 4068 + 252. 


jai 
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5. Evaluate the sum 


Dera 


— 
Il 
un 


ive 


6. Evaluate the sum 


L 
De Geta Crees 


i 
BR 


7. A Gaussian integer is a complex number v + wi, where v and w 
are ordinary integers. The norm of the Gaussian integer v + wi is 
N(v + wi) = v? + w. Prove that 


S- (v* — 3v?w?) = ; S- (v + wi)*. 


n=v24+w? N(v+wi)=n 


14.8 Notes 


Liouville’s identity, applied to “appropriate” polynomials and rearranged, 
gives formulae for the number of representations of an integer as the sum of 
an even number of squares. Our manipulations evolved the old-fashioned 
way, by hand with pencil and paper, but almost certainly it is possible 
today to do this more efficiently with human-assisted computer algebra 
systems. It would be a useful exercise to derive formulae for R,(n) for even 
numbers s > 12 using software such as Maple or Mathematica. 

The proofs in this chapter are based on Venkov’s exposition [149] of 
Liouville’s method. Analytic proofs of these results can be found in the 
books of Grosswald [43], Knopp [81], and Rademacher [119]. An interest- 
ing discussion of the problem of sums of squares appears in Hardy’s book 
Ramanujan [52, Chapter IX]. 

Iwaniec [74] considers the more general problem of the number of repre- 
sentations of an integer n by a positive definite quadratic form Q(a1,...,Xs). 
We denote the representation number by rg(n). This is the Fourier coeffi- 
cient of the theta function 


and 
6Q(z) = Eg(z) + Fo(z), (14.19) 


where Eg(z) is an Eisenstein series and Fg(z) is a cusp form. 
In this chapter we considered the positive definite quadratic form 


O(a, cei tg) = ee eee ee?, 
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If s is even and s < 8, then the cusp form in (14.19) is zero and r.(n) is 
the coefficient of an Eisenstein series. If s is even and s > 10, then the cusp 
form in (14.19) is nonzero, and the main term in r,(n) is the coefficient 
of an Eisenstein series and the remainder term is the coefficient of a cusp 
form. In this case, Liouville’s formulae might provide a method to compute 
the coefficients of cusp forms. 


15 
Partition Asymptotics 


15.1 The Size of p(n) 


A partition of n is a representation of n as a sum of positive integers. The 
order of the summands does not matter. We often write the partition in 
the form 

N= 4, + 42 +++: + ag, 


where 
ay > ag>:::>ap> il. 


For example, the partitions of 5 are 


5, 

Asie 

342: 

ees 

rene eo 
pe ee ae 
Pag eT 424, 


The unrestricted partition function p(n) counts the number of partitions of 
the positive integer n. Thus, p(5) = 7. This function is strictly increasing, 
and satisfies the asymptotic formula 


(15.1) 
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2 | 12 
= = 24/ — = 2.565.... 
Co nf? 6 


log p(n) ~ con. (15.2) 


Hardy and Ramanujan [58] and Uspensky [146] independently discovered 
this result; their proofs used complex variables and modular functions. 
Erdés later found an elementary proof of (15.1). The idea of Erdés’s proof 
is simply to apply induction to the recursion formula (Theorem 15.1) 


np(n) = S- up(n — kv). (15.3) 


where 


It follows that 


The proof, however, is difficult; it is “elementary” only in the technical sense 
that it does not require complex analysis. We shall use Erdés’s method to 
obtain (15.2). The determination of the asymptotics of partition functions 
is our third problem in additive number theory. 

Let A be a nonempty set of positive integers, and let d = gced(A). For 
every positive integer n, the partition function p(n) counts the number 
of partitions of n into parts belonging to A. We define p,4(0) = 1 for all 
sets A. We would like to understand the asymptotic behavior of p(n). For 
example, if A is the set of odd positive integers, then pa(n) is the number 
of partitions of n into odd parts, and log pa(n) ~ my/n/3. 

If d = gcd(A) > 1, we consider the set A’ = {a/d : a € A}. Then 
gcd(A’) = 1, and 


0 ifn 40 (mod d), 
Pale) ={ oy tna) ifn =0 (mod d). 


Thus, it suffices to consider only partition functions for sets A such that 
gcd(A) = 1. 

We do this in two significant cases. In the first, A is a finite set of integers 
with |A| = k and gcd(A) = 1. We shall prove that 


rat) = (qa) eat) 


In the second, A is a set of integers of positive density d(A) = a with 
gcd(A) = 1. We shall prove that 


log pa(n) ~ cov/an. (15.4) 


We shall also prove an inverse theorem: If A is a set of positive integers 
whose partition function satisfies (15.4) for some a > 0, then gcd(A) = 1 
and A has density a. 

We begin by proving the recursion formula (15.3). 
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Theorem 15.1 For every positive integer n, 


np(n) = S- up(n — kv). 
ku<n 
kyv>1 
Proof. The parts in a partition of n are positive integers v not exceeding 
n. The number of partitions of n with at least one part equal to v is p(n—v). 
For any positive integer k, the number of partitions of n with at least k 
parts equal to v is p(n — kv), and so the number of partitions of n with 
exactly k parts equal to v is p(n — kv) — p(n — (k + 1)v). Therefore, the 
number of parts equal to v that occur in all partitions of n is 


ee k(p kv) — p(n — (k+1)v)) = S7 p(n — kv). 


k>1 k>1 


We list the p(n) partitions of n as follows: 


Q1,1 7 41,2 T*** 1 Q1,ky; 
m= 421 1742.2 T**' + a2 ke, 
= 43,1 1 43,2 T°** + a3,k3; 

NM = Ap(n),1 + Ap(n),2 + °°" + p(n), kpony’ 


Adding the p(n) rows of this array, we obtain 


p(n) ky 


np(n) = Dd) > paig 


<—e 


= dvr 


Qij=U 


= yom 


v=1 k>1 


= S- up(n — kv). 


ku<n 
k,v>1 


This completes the proof. 


Exercises 


1. Compute p(n) for n = 1, 2,3,4. 
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2. Let g(n) denote the number of partitions of n into distinct parts. Let 
A be the set of odd numbers and pa(n) the number of partitions 
of n into not necessarily distinct odd parts. Compute p(6), q(6), and 
pa(6). 


3. Compute p(7), ¢(7), and pa(7). 
4. Use the recursion formula (15.3) to compute p(8). 
5. Let A= {1} U{2n:n > 1}. Prove that 
pa(2n) = pa(2n+ 1) 
for all nonnegative integers n. 
6. Prove that if pa(n) > 1 and pa(no) > 1, then pa(n) < pa(n+ no). 


7. Let A be a nonempty set of positive integers, and let a; € A. Prove 
that the partition function p(n) is increasing in every congruence 
class modulo aj, that is, 


pa(n) < pa(n + a1) 


for every positive integer n. 


Prove that for every real number x > a, there exists an integer u 
such that 
a-a<uce 


and 
max{pa(n):0<n<a}=pa(u). 


15.2. Partition Functions for Finite Sets 


Theorem 15.2 Let A be a nonempty finite set of relatively prime positive 
integers, with |A| =k. Let pa(n) denote the number of partitions of n into 
parts belonging to A. Then 


rate) = (qa) eat - 


Proof. The proof is by induction on k. If k = 1, then A = {1} and 
pa(n) = 1, since every positive integer has a unique partition into a sum 
of 1’s. 

Let & > 2, and assume that the theorem holds for k — 1. Let A = 
{ai,...,a%}. Then gcd(A) = (a1,...,a,) = 1. If d = (a1,...,@x-1), then 
(d,a,) =1. Fori=1,...,k—1 we set 


a; 
! a 
a,=—. 


od 
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Then gced(aj,...,a,_,) =1, and 
A’ = {a},..-,@,_1} 


is a set of k — 1 relatively prime positive integers. Since the induction 
assumption holds for A’, we have 


= 1 nk-2 LO (nk-3 
pan) =\ Tera) Gat Oe) 


/ 
i=1 % 


for all nonnegative integers n. 
Let n > (d—1)ax. Since (d, ay) = 1, there exists a unique integer u such 
that O0<u<d-—1 and 


n=uwua, (mod da). 
Then 


nr — UaE 


d 


m= 
is a nonnegative integer, and 
0O<m<n. 
If v is any nonnegative integer such that 
n=vax (mod d), 


then va, = ua, (mod d), andsov=wu (mod d), that is, v= u+ éd for 
some nonnegative integer @. If 


n— voz, =n— (ut ld)az > 0, 


then 


n u m 
o<es |e 3} =[2]=rsm 
Let 7 be a partition of n into parts belonging to A. If 7 contains exactly v 
parts equal to ax, then n—vay > 0 and n—vax, =0 (mod d), since n—vay 
is a sum of elements in {a1,...,@,—1} and each of the elements in this set 
is divisible by d. Therefore, v = u+ ¢d, where 0 < @ < r. Consequently, 
we can divide the partitions of n with parts in A into r + 1 classes, where, 
for each = 0,1,...,r, a partition belongs to class ¢ if it contains exactly 
u-+ éd parts equal to ax. The number of partitions of n with exactly u+ éd 
parts equal to a, is exactly the number of partitions of n — (w+ ¢d)axz into 
parts belonging to the set {a1,...,a%—1}, or, equivalently, the number of 
partitions of 

n—(ut éd)ax 

d 


=m— lax 
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into parts belonging to A’, which is exactly p4/(m — €a,). Therefore, 


pa(n) 


So par(m — lax) 
£=0 


1 5 m — bax)*-? es 
= (Gera) (Sea re6) 


dk-1 “. (m — baz)*-2 2 
= (=) >! ah + O(n*-?), 


We evaluate the sum as follows. Since 


y= 


by Exercise 5, and since 


ay 


we have 
“. (m — bay)*-? 
=0 
r k-2 k—2 wa 
a aA LLl; )m *~2-F(—Lay)4 
£=0 j=0 

os LR B-2) peg i pi 
7 cmd j )m ey) af 

ee fan) ae . 
z aL j )m oe ee 0) 
ee ee ee eee ae 
~ amy j )m (9%) (a 
_ m1 k-2) — (-1) k-2 
a a > ( ; )w=aigey tor 

hoi b-2 
man, wth (as j m*-2 
~ “a, 24 =2-iig ry +O) 

k—1 b-2 
oe, HL (=I? : 2 
ee caries rs (Cos i ae 
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mkt ad : 
: sate ap 2 G1) + Oem ) 
mk-1 
ax (k — 1)! + O(m*-?) 
Therefore, 
dk-1 r (m — fax)* 2 ee 
(n) ( =  O(nk-2) 
DA eae 2)! 
k-1 me} 
- GS -| (<a 71)! + om!) + O(n?) 
l n— uap)*-* ; 
- (a 7 Roi + O(n*?) 
1 mel = 
ly -| @-p Oo” ) 


This completes the proof. 


Corollary 15.1 Let py(n) denote the number of partitions of n into at 
most k parts. Then 


nk-1 


Sar er a 


px(n) 

Proof. We know that p;,(n) is also equal to the number of partitions of n 
into parts no greater than k. The result follows from Theorem 15.2 applied 
to the set A = {1,2,...,k}. 


Corollary 15.2 Let A be an infinite set of positive integers with gcd(A) = 
1. Then i 


n—0o log n 


Proof. For every sufficiently large integer k there exists a subset F;, of 
A of cardinality k such that gcd(F;,) = 1. By Theorem 15.2, 


nol 
pa(n) > pr,(n) = f—O\]Lce a O (n*-?), 


and so there exists a positive constant c, such that 


pa(n) > cyn®-? 
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for all sufficiently large integers n. Then 

log pa(n) 2 log pr,(n) 2 (k — 1) logn + log cr. 
Dividing by logn, we obtain 
1 


lim inf > 
n—00 og n 


This is true for all sufficiently large k, and so 


1 
lim OS PA (n) = 


noo logn 


This completes the proof. 


We can also use generating functions to compute partition functions of 
finite sets. For example, let A = {1,2,4}. By Theorem 15.2, we have 


ie 
n) ~ —+ O(n). 
pa(n) ~ 7 + O(n) 
Using the partial fraction decomposition of the generating function, we can 
obtain an exact formula for pa(n) that is stronger than this asymptotic 
estimate. We have 


— n _— 1 
Lpalne” = Gas aaa 
1 
~ (i 2)8(1+2)2( + 2?) 
ia 9 1 1 
~ 32(1—2) ' 4(1—2)? © 8(1—2)3 
5 1 l+a 


3(14+a) 1 l+a + +a) 


We write each partial fraction as a power series: 


9 = an 
3201-2) 33" 

1 — wa(nt+l) , 
4(1— 2)? Ss A 

1 ea (nt+2)(n+1), 
81-2 dX 16 vr 
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1 ia 3 (=D)"(r+ I) on 
16(i-pg)? ee 16 
l+@z - - (-1)"(1+2) 2, 
8it+22) dX 8 
= =)" 2n = asl Wigs 2n4+1 
Sg 
n=0 n=0 
_ ae 
n=0 8 
Therefore, 
Oe 9 ntl (n+2)(n+1) | (-1)"5 
ae nO cae 16 Cag 
Oe eA car a 6 RO Co Dac 
16 8 
nr? +(7+(-1)")n ds 21+ (-1)"7+ (-1)lr/4la 
7 16 32 
If n is even, then 
is n?+8n+16  (-1)P/7l—1 
eo 16 8 
ee ees ifn=0 (mod 4), 
int 21 iPS? “(mod 4): 
If n is odd, then 
(n) n+én+9 (-1l/l—1 
n = 
ee 16 8 
(aed ifn=1 (mod 4) 
6 cae Fr 


Exercises 


ey —; ifn=3 (mod 4). 
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1. Let p2(n) denote the number of partitions of n into at most 2 parts. 


Prove that 


n 


po(n) = [| apt 


2. Let a > 2 and A = {1,a}. Prove that 
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3. Let A = {2,3}. Prove that 


oo 2) 41 if n is even and n > 2, 
Pal(n) = n=3) 41 if nis odd and n> 3. 


4. Let A = {2,a}, where a is an odd integer, a > 3. Compute p(n). 


5. Prove that 
pitl 


ee Gad + O(r). 


6. Let A= {1,2,3}. Let p = (—1+7iV3)/2. Confirm the partial fraction 
decomposition 


Hie os 1 
Dd palnye _ (i a)(1 x2)(1 x3) 


1 
(1 2)8(1 + 2)(1 — px) (1 — p?2) 
= 1 1 17 
~ 61—2)3 © 4(1—2)? © 72(1-2) 
1 1 1 
' 8(11+2)  9(1—pax) | 9(1— p2x) 


Show that this implies that 


pa(n) = mens) us = CN" + Z(e" + p>") 
-< ae sey 
where ; 
Ir(n)| < 5" 


Conclude that p(n) is equal to the integer closest to (n + 3)?/12. 


7. Let pz(n) denote the number of partitions of n into at most k parts. 
Show that the average number of parts in a partition of n is 


Pl) = —— Yk (x(n) — Palm). 


k=1 


Remark. Erdés and Lehner [35] proved that p(n) ~ cp + V/nlogn. 
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15.3 Upper and Lower Bounds for log p(n) 


In this section we give Erdés’s elementary proof that log p(n) ~ co./n. We 
begin with some estimates for exponential functions. 
Define p(0) = 1 and p(—n) = 0 for all n > 1. 


Lemma 15.1 If0< <n, then 


ue ma pa <n b< Yn ee 
Proof. If 0 < x <1, then 
fate E26 a)? 12, 
2 2 2 


The result follows by letting 7 = ¢/n. 


Lemma 15.2 If x > 0, then 


If0<a <1, then 
2: 


> 
i= e-*) 72 


Proof. The power series expansion for e” gives 


ee 1 yy 2k4+1 
2/2 _ 4-2/2 _ 9 (=) 
ii Pi ren 2 


: g2k—2 
192k 
2, (Qk +112 


If « > 0, then 


and so 


If0<a <1, then 


/2 _ o-«/2 3 3 
e” "Te <uet+en Se u+az ee 
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and so 


e-* 1 1 a. a 
SNe = _ 5) > av = 3 2. 
(1 -—e7*) (e*/? Eox@ Bie) x x 


Lemma 15.3 Letc be a positive real number and let n be a positive integer. 


Then 
es e 2vn Qn? n 
» — ck_ < 3¢2 i 


Ifn > c?/4, then 


ck 
2/n 


and so 


If fn >c/2 andl <k < 2\/n/c, then 0 < x < 1 and, by Lemma 15.2, 


ck 


e 2vn Be 1 — An 
(1 - e 2vm)2 a? Bees 
Therefore, 
oa) _ _ck _ ck 
Ee 2vn Ee 2vn 
» Sar eg y re. Se 
k=1 (l-—€ 3v") ioe Ae) 
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4n [1 1\ 4 
t(Se- D pl]-~ 


k=1 k>2Vn/e 


IV 
| 


_ 2n?n An 3 1 4/n 
7 k? Cc 
k=[2Vn/c]+1 


For k > 1 we have 
1 1 k+1/2 ay 
mS rare ee #2 


4n S es dt _ 4n 1 
2 k2 Cc [2./n/c]+1/2 t? Cc? [2./n/c] + 1/2 


k=[2\/n/c]41 
Z An 1 Z An 
2 2/n/e—-1/2~— ¢ 
In the last inequality we used the fact that /n > c/2. Therefore, 


Sek 
e 2vn 2r?n = 8/n 


[oe) 
Ds ar ras 3c? c 


Lemma 15.4 Let0<t< 1. Then 
= t 


So vt” = Gi —@: 


v=l1 


and 
+4? +t 6t 


Proof. Differentiating the power series 


we obtain 
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a-HF ~ 0 ae 


v(u— 1)(v — 2)? 


aN 
llo 

= 

ll 
8 


e 
II 
w 


(v? — 3u(v — 1) — v)t? 8, 


I 
lege 


v=3 
and so 
s ace cmmrsr' ata oe Ge" me 1 
Then 
Soo ae 32 o(v— 12 +t 0 
a 4 
v=1 (1 t) v=2 v=1 
~ 6 6t? feat 
ie) ee) ane eae ee Cn 
_ 844i? +¢ 
(1-28) 
2 6t 
~ \ (Ei)* 


Theorem 15.3 


log p(n) ~ covin. 


Proof. We shall use induction to obtain upper and lower bounds on p(n). 
First we prove that 
p(n) < ev” (15.5) 


for all nonnegative integers n. This is clearly true for n = 0 and n = 1. Let 
n > 2, and assume that the inequality holds for all integers strictly smaller 
than n. The notation >°,,,-,, means the sum over all positive integers k 
and v such that kv < n. We have 


S- up(n — kv) < S- werer rhe 


kv<n ku<n 


S> ve 


ku<n 


I 


np(n) 


IA 


(by Lemma 15.1) 
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ev a (eH) 
k=1v=1 
oe __ cok 
avn 
= evr S*__ © "(by Lemma 15.4) 
k=1 (: —e vn) 
2 2 
< (=) necov™ (by Lemma 15.3) 
305 
= nev”, 


This gives the upper bound (15.5). 
Next we shall prove that for every ¢ with 


0O<E<c 
there exists a constant A = A(e) > 0 such that 
p(n) > Aelo-e)v™ (15.6) 


for all positive integers n. We begin by letting A = e~®. Then (15.6) holds 
for n = 1, since p(1) = 1 > e7& = Ae™*. 
Let n > 2, and assume that (15.6) holds for all integers less than n. Then 


np(n) = > vp(n— ke) 


ku<n 
> A S- yeloe)¥n—ke 
ku<n 
> A S vel (Vea aa) (by Lemma 15.1) 
ku<n 
= Aelo- WHS yo (“Vaya ant), 


ku<n 


We shall show that 


kv k2y2 
y ve (co e) (te +Han) >n. 
ku<n 
Since e~” > 1-2, we have 


ky? 


oe ((co-e) =) >1- (co = €)k?v? 


9n3/2 ? 


and so 


kv k2 v2 
y ve ©) (stat Sa) 


ku<n 
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_ (co-e)kv CM —e_e _ (core)kv 
> y ve 2vn co!) ) heute Vn 
-_ 2n3/2 

ku<n ku<n 


II 
WH 
H 
— 
3 
we 
| 
— 
a 
j=) 
io) 
Ne 
tO 
— 
3 
Ww 


We shall estimate the sums $\(n) and S9(n). 
If kv > n, then 


ae s (co es zs oes) 


> 0. 


Since 
ett? for t > (co — €)/2, 
we have 
_ (eg-e)ku = k —6 
vA «(Se ) 
ku>n ku>n vn 
1 
3 
<0 Dae 
ku>n 
1 
3 
aS > (kv) 7/25/2437? 
ku>n 
Leet, ON aoe 
ey) Pp 
5/2 3/2 
vn k=1 es gai ‘ 
1 
<a 
n 
Then 
(cg —e) kv 
Si(n) = So ve a 
ku<n 
28. 28 (cg —e)kv (cg —e) ku 
> See ae 
k=1v=1 ku>n 


~ e€ 2a 1 
= yz tO (=) (by Lemma 15.4) 
Vn 


> a—azq t O(Vn) (by Lemma 15.3) 
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2 2 2 
PR. 5 PG Eta. 8 
3(co — €)? Co —€ Co —€ 

€ 


since 


V 
4+ 
ae 
| 


= s 
We estimate the sum $5(n) as follows: 
_ fe ae 
Ss eee oe 
ku<n 
2 _ (o-e)ku 
< yh ye oo aga 
n — “ore Ok 
ke 
7 , ie (by Lemma 15.4) 
a (1 —e ~* ) 
(co—e)k 
. e 2/n k2 
= 6 
y ( saa) ( Sg) 
i as l—-e 2vm 
“ 4n k? 
ws by L 15.2 
k=l (Sas) ( ey) (by Lemma ) 
7 l—e  2vn 
< a 1 
k=1 ( See) 
ia l-e 2Jn 
Let 


Ifl<k< Vn, then 0 < x < co/2 and 


xz 
l-e*= | edt > ze~* > xe~/?, 
0 


2 is 
(1 = cS" ) =(1- ee) Spe ea SMT ate aay 


Therefore, 
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If k > Jn, then 


S- : <x ee 


(co~e)k = Stes) 
Va<k<n (1 —@ 2vm Vn<k<n {1—e7 


Therefore, 
So(n) K n?. 
Since 
Si(n)>0 and Sg(n)>0, 
we have 
(co — €) 2e = (co — €) 2 
Si(n) — 33/2 a(n) > (1+ a n+O(/n) — 33/8 O(n) 
2 
> (1+ =) n—cC/n 
Co 


for some positive constant c;. Then 


IV 


Ace) (syn )- une )) 
Ane" 4 Ayeleo-2v (ZEW _ ) 


Co 


np(n) 


IV 


Ss Anelco-vn 


if we choose A > 0 small enough that (15.6) holds for all n < (c9c,/2e)?. 
It follows from (15.5) and (15.6) that for every « > 0 there exists a 
constant A such that 


(co — €)/n t+ log A < log p(n) < con 


for all positive integers n, and so log p(n) ~ con. This completes the proof 
of the theorem. 


Exercises 


1. Prove that the recursion formula (15.3) is equivalent to 


np(n) = 5 a(v)p(n— Vv). 


v=1 
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15.4 Notes 


In 1918 Hardy and Ramanujan [59, 58] published the asymptotic formula 
for the partition function. Uspensky [146] obtained the same result indepen- 
dently in 1920. Both papers used complex variables and modular functions 
to deduce the asymptotic estimate p(n) ~ (4n\/3)~tev”. In their 1918 
paper, Hardy and Ramanujan wrote, 


it is equally possible to prove [log p(n) ~ co./n] by reasoning of 
a more elementary, though more special character; we have a 
proof, for example, based on the identity 


np(n) = J o(v)p(n—v), (15.7) 


where o(v) is the sum of the divisors of v, and a process of 
induction. 


Many years later, however, Hardy wrote in his book Ramanujan [52, p. 
114], 


It is actually true that log p(n) ~ m\/(2n/3) ..., but we cannot 
prove this very simply. 


Hardy and Ramanujan clearly had no elementary proof of the asymptotic 
formula (15.1); in their 1918 paper they wrote that 


we are at present unable to obtain, by any method which does 
not depend upon Cauchy’s theorem, a result as precise as [p(n) ~ 
ev" /(4y/3)n], aresult, that is to say, which is “vraiment asymp- 
totique.” 


Erdés’s proof of the asymptotic formula for p(n), published in 1942 in [32], 
is a tour de force of elementary methods in number theory. This proof is 
not as famous nor as controversial as the elementary proof of the prime 
number theorem, but it is impressive in its depth and technical difficulty. 
It shows that the asymptotic formula for p(n) is simply a consequence of 
the elementary recursion formula (15.7), and is independent of any deep 
analytic properties of modular functions. 

Knessl and Keller [80] develop Erdés’s method and apply the recursion 
formula for the partition function to derive formal asymptotic expansions. 

Grosswald [42] and Hua [68] have presented Erdés’s elementary proof 
of (15.2). There is a different elementary proof of the upper bound log p(n) < 
m/2n/3 in unpublished lectures of Siegel on analytic number theory; Siegel’s 
proof appears in Knopp [81, pp. 88-90]. Analytic proofs of (15.1) can be 
found in Apostol [4], Knopp [81], and Rademacher [119]. 

The standard proof of Theorem 15.2 uses the partial fraction decomposi- 
tion of a generating function. The proof in this book is due to Nathanson [107]. 
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Let Py(n) = p(n) — pe—i(n) denote the number of partitions of n into 
exactly k parts. Erdés [33] proved that for fixed n, the maximum value of 
P,(n) occurs for kg ~ cg 'n'/? logn. This had been conjectured by Auluck, 
Chowla, and Gupta [6]. Using hard analysis, Szekeres [137, 138] proved that 
for sufficiently large n, the finite sequence P;(n) is unimodal in the sense 
that there exists an integer ko such that P,_-1(n) < Py(n) for 1 < k < ko 
and Pr_-i(n) > Px(n) for ko + 1 < k < n. It would be very interesting 
to have an elementary proof of the unimodality of the partition function 
P,, (n). 

Rademacher [117, 118] obtained a convergent series for p(n) of the form 


_ 1 ‘ kl/?-Ay(n) 4 pias Ge V3) 


Xn 


After studying the original paper of Hardy and Ramanujan, Selberg (un- 
published) independently proved the same formula. Many years later he 
wrote [130], “I am inclined to believe that Rademacher and I were the only 
ones to have studied this paper thoroughly since the time it was written.” 


16 


An Inverse Theorem for Partitions 


16.1 Density Determines Asymptotics 


Let A be a set of integers, and let A(x) denote the number of positive 
elements of A that do not exceed x. Recall that A(x) is called the counting 
function of A. Then 0 < A(x) < a, and so 0 < A(a)/ax < 1 for all x. The 
set A has asymptotic density a if 


For example, the set of all positive integers has density 1, and every finite 
set has density 0. The set of even integers has density 1/2. By Chebyshev’s 
theorem (Theorem 8.2), the set of prime numbers has density 0. 

If A has density a, then for every ¢ > 0 there exists a number x(¢) such 
that for all x > xo(e), 


A 
Ae! =e 
x 
or, equivalently, 
(a—e)x < A(x) < (a+ e)z. (16.1) 


There exists an integer ko(e) such that if a, € A and k > ko(e), then 
ap > Xo(e). Setting x = ay in inequality (16.1), we obtain 

(a—e)ap <k < (a+ €)ag, 
and so 


k 
— <ap< . 
Aa+e a-e 
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In Chapter 15 we proved that log p(n) ~ co,/n. In this section we shall 
prove that if A is any set of integers of density a > 0 and gcd(A) = 1, then 


log pa(n) ~ coVan. (16.2) 


In Section 16.2 we prove the converse: If A is any set of positive integers 
whose partition function p4(n) satisfies (16.2) for some a > 0, then A has 
asymptotic density a. 

A set of positive integers is cofinite if it contains all but finitely many 
positive integers. We begin with a simple result about partition functions 
of cofinite sets. 


Lemma 16.1 Let A be a cofinite set of positive integers. Then 


log pa(n) ~ covn. 


Proof. If A is cofinite, then A contains all sufficiently large integers. 
Choose a positive integer £ > 1 such that A contains all integers greater 
than @, that is, 

B={n>l+1} CA. 
Then 
pp(n) < pa(n) < p(n). 
Since log p(n) ~ cov/n, it suffices to prove that log pp(n) ~ cov/n. 

Consider the finite set F = {1,2,...,¢}. Since gcd(F’) = 1, Theorem 15.2 
implies that there exists a constant c > 1 such that pp(n) < en*? for all 
positive integers n. Each part of an unrestricted partition of n belongs to F 
or to B, and so every partition of n is uniquely of the form n = (n—m)+m, 
where n — m is a sum of elements of F and m is a sum of elements of B. 
By Exercise 4, the partition function pa(n) is increasing for n > 1, and so 


p(n) = >) pr(n—m)pp(m) 
m=0 


IN IA 


Taking logarithms and dividing by co./n, we have 


log p(n) é log2c+flogn | logpa(n) 
oVv/n — con Eg/n 
é log 2c + (€— 1) logn i log p(n) 
= covn con | 


Letting n go to infinity, we obtain log pp(n) ~ co\/n. This completes the 
proof. 
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Theorem 16.1 Let A be a set of positive integers. If A has density a > 0 
and gcd(A) = 1, then the partition function pa(n) satisfies the asymptotic 
equation 

log pa(n) ~ covVan. 


Proof. Let A = {ax}%2,, where aj < ag < ---. Let 0 < € < a. Since 
d(A) = a and gcd(A) = 1, there exists an integer >) = fo(€) such that 
gcd{a, :1<k < lo} =1 and 


(16.3) 


ef 
QAa+e a-e 


for all k > &. 
We begin by deriving the upper bound 


log pa(n) 
] oe. 
pre coVfan — 


Let F = {a@1,@9,...,a¢,} and B = {a, € A: k > & 41}. Let mbea 
positive integer, m <n, and let 


Mm = ap, + Ag +++: + Ge, 


be a partition of m with parts in B. To this partition of m we associate 
the partition 
W=kytkot--- +k. 


By (16.3) we have ki < (a+ €)ax,, and so 


/ 


Wo < (atejag, + (a+ €)an, +++: + (a+ €)ax, 
= (a+te)m 
< (a+e)n. 


This establishes a one-to-one mapping from partitions of m with parts in 
B to partitions of integers n’ less than (a + €)n. Since the unrestricted 
partition function p(n) is increasing, we have 


pa(m) < So p(n’) 


1l<n'<(ate)n 
(a + €)np([(a + €)n]) 
<  2np([(a+e)n]). 


IA 


Recall that A = FU B, where F consists of €9 relatively prime positive 
integers. By Theorem 15.2, there exists a constant c such that 
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for every positive integer n. Every partition of n with parts in A decomposes 
uniquely into a partition of m with parts in B and a partition of n — m 
with parts in F for some nonnegative integer m < n. Then 


pa(rn) = © pp(n—m)pz(m) 


m=0 


< eno ‘2 pa(m) 
m=0 

< on) > anp(((a + )n)) 
m=0 

< den? p(((a + e)n)). 


Since log p(n) ~ co./n, it follows that for every ¢ > 0 there exists an integer 
no(e) such that 


log p([(a + €)n]) < (1+ e)covV [(@ + €)n] 


for n > no(e). Therefore, 


logpa(n) < log 4e+ (lo + 1) logn + log p([(a + €)n]) 
< log4c+ (€9 + 1) logn+ (1+ e)eoV/(ate)n 


for n > l9(e). Dividing by co./an, we obtain 


logpa(n) — log4c+ kp logn / E 
< 1 14+- 
cofan — cofvan TUTE) itt a’ 
E 


and so 


This inequality is true for all ¢ > 0, and so 


l 
lim sup SoA) ) 


<i. 
noo 4CcoVan 


Next we obtain the lower bound 


| 
tim inf OS PAM™ 5s 1. 


n>c00 86cgV/an 


Since gcd(A) = 1, Theorem 1.16 implies that p4(n) > 1 for all sufficiently 
large n. For 0 < € < a, there exists a positive integer 0) = fo(€) such that 
gcd{a, :1<k < lo} =1 and 


< ak < 
QA+eé a-e 
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for all k > @. 
Let p’(n) denote the number of partitions of n into parts greater than 
fo. To every partition 


n=ky+---+k, with ki >--->k, > &, 
we associate the partition 
M = Gp, +++ + Gz,.- 


Inequality (16.3) implies that 


m< 


a-Ee 


This is a one-to-one mapping from partitions of n with parts greater than 
fo to partitions of integers m < n/(a@—€) with parts in A. Therefore, 


p(n) < >> palm) 


m<ote 
n 
< max {pa(mn) :m< \ 
a-€ a-€ 
2 mpa(un) 
a-€ 


where, by Exercise 7 of Section 15.1, uy is an integer in the bounded interval 


— ay <Un < : 
a—eE a—e 


The sequence {u,,}°2, is not necessarily increasing, but 


lim un = o. 
n— oo 


Let d be the unique positive integer such that 
0<(a-e)a,<d<(a-e)a, +l. 


For every 2,7 > 1, 


(i+ j)d id jd 
UGitj)d — Wid > ( ay hee ee a, > (y-l1)a. 


It follows that uG@41)q > wia, and so the sequence {ug}, is strictly in- 
creasing. Similarly, 


) id jd Z J 
i+j)d— Wid < = —— +a, <(jt+lait 
UG+ja~ Wid ( a) ee ee 
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Choose No such that pa(n) > No for all n > No. Let ig be the unique 
integer such that 


al +1<ig< mo + 2. 
ay ay 
Then 
Uid — Ui—ig)d > (to — 1)ar = No 
for all 2 > ip. For every integer n > iod there exists an integer 7 > ig such 
that 
Wid SN < Uta: 


Then ag 
; 10 
NM — UGHip)d S UG+1)d — UGi-in)d < (to + 2)d an ie 
and 
N— U(i-in)d 2 Wid — UE—ip)a > No. 
Therefore, 


pa(n — Ug—io)a) = 1. 
By Exercise 6 of Section 15.1, 


(a — e)p'((i — to) d) 
(i —io)d , 


pa(n) = pa(uci—ioya) > 


Since G4)d 
a+ 
; < 
N<UG4)ad S Bese 


it follows that 
(i —io)d > (a—e)n — (in + 1)d 


and 


(a — €)p'((a — €)n — (to + 1)d) 

(i —io)d 
Since p'(n) is the partition function of a cofinite subset of the positive 
integers, Lemma 16.1 implies that for n sufficiently large, 


pa(n) > 


logpa(n) > logp’((a—e)n-— (io + 1)d)) + log(a — €) — log(i — ig)d 
> (1—e)eoV(a—e)n — (to + 1)d 4+ log(a — €) — log(i — io)d. 
Dividing by co.\/an, we obtain 


_. , logpa(n) 
lim inf >(1 i , 
R00 Cyx/Om 2 (1-€) ore 


This inequality holds for 0 < ¢ < a, and so 


lim inf log pa(n) pa(n) 
n—>c0 §=6cgvV/an 


ke 


This completes the proof. 
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Exercises 


1. Prove that the set {2* : k > 0} has density 0. Prove that the set 
{2*3° : k,£> 0} has density 0. 


2. Let A be a set of positive integers, and let B = N \ A be the set of 
positive integers not in A. Prove that if d(A) = a, then d(B) = 1-a. 


3. In this exercise we construct a set A that does not have a density. 
We denote by (2, y] the set of integers n such that x <n < y. Let 
N, < No < N3 < --- be a strictly increasing sequence of positive 
integers such that lim,—.. N;+1/N; = 00, and let 


A= U (Nor—1, Nor]. 


r=1 
Prove that 
a A(N2,) 
li =1 
TCO Nor 
and a 
tm AN2r+1) _ 9, 


roco | Nop qa 


Since lim sup,,_,,, A(z)/a = 1 and liminf, .., A(x)/x = 0, the set A 
does not have an asymptotic density. 


Hint: Show that A(N2,) = No, a Nor—1 and A(Nor41) < Nor. 


4. We say that a partition aj +a2+---+a, has a unique largest part if 
a, > a2 >--:>a,. Let no be a positive integer, and let A be the set 
of all integers greater than or equal to no. Show that pa(n) = 1 for 
No <n < 2no. Let n > no. To every partition 7 of n we can associate 
a partition of n+ 1 by adding 1 to the largest part of 7. Show that 
this map is a bijection between partitions of n and partitions of n+ 1 
with a unique largest part. Deduce that p,(n) is increasing for n > 1, 
and strictly increasing for sufficiently large n. 


5. Let a,,...,@¢, and m be integers such that 
Ll<a,<-:-<ae<m 


and 
(a1,.--,@e,m) = 1. 


Let A be the set of all positive integers a such that a= a; (mod m) 
for some i= 1,...,@. Prove that 


Ln 
log pa(n) ~ co4/ —. 
m 
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6. Prove that if the set A of positive integers has positive density, then 


d(A) = lim (‘e524 


nee Tog pln) 


7. Let A be a set of positive integers. The upper asymptotic density of 


A is éj 
dy (A) = limsup Ae) 
n 


N— Oo 


Prove that if ged(A) = 1 and dy (A) < a, then 


8. Let A be a set of positive integers. The lower asymptotic density of 


Ais A 
iD ai 


n—-00 n 


Prove that if ged(A) = 1 and d,(A) > a, then 


9. Let A be a set of positive integers with gcd(A) = 1. Prove that if 
d( A) = 0, then log p4(n) = o(,/n). 


16.2 Asymptotics Determine Density 


The goal of this section is an inverse theorem for partitions. We shall prove 
that the asymptotics of the partition function p(n) determines the density 
of the set A. 

We begin with some remarks about generating functions. If a is a positive 
integer and |a| < 1, then the geometric progression 


(lag) tS 1+ 2% + 97% 4+ 9° 4+... 


converges absolutely. If A is a finite set of positive integers, then 


[]a-2)" - I @ tat + x74 + g84 +...) 


acA acA 

ioe) 
So pa(n)a”, 
n=0 


where pa(n) is the partition function for A. 


I 
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If A is an infinite set of positive integers and |a| < 1, then the infinite 


product 
][@-2)" 


acA 


converges absolutely, since 


Shalt s Dolat= <0 


acA 


and 


f(v) = [[a-2%)- = Spal 


acA 
This function is called the generating function for the partition function 
pa(n). 


Theorem 16.2 Let A be a set of positive integers with gcd(A) = 1. Let 
pa(n) denote the number of partitions of n with parts in A. If there exists 
a number a > 0 such that 


log pa(n) ~ coVan, 
then the set A has density a. 


Proof. The proof uses an Abelian theorem (Theorem 16.3) and a Taube- 
rian theorem (Theorem 16.4) that we prove in the next section. The gen- 
erating function 


= S> pa(n)a” = [[c —¢@*)"! 


acA 


converges for |x| < 1. Since 


log pa(n) ~ cofan = 2 


Theorem 16.3 immediately implies that 


Applying the Taylor series 


—log(1 — 2) => 
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for |x| < 1, we have 


Co ak Co 
a v n 
log f(z) = — © log(1 — 2) = eae Pe 
acA a€Ak=1 n=1 
where 
Se yr 
acA acA 
n=ak aln 
By Theorem 16.4, 
2 
nO a ee 
nN<ux 


We define the remainder function r(x) by 


Tax 


Sp(x) = r (1+r(a)). 


The function Sg(x) is an increasing, nonnegative function such that 
Sp(a) =0 for x < 1 and 


I 


1 
DOPE, 
ee aa 


1 
= Soe 
k<a wees 


EMG 


k<ax 


Sp(z) 


where A(z) is the counting function of the set A. By Mobius inversion 
(Exercise 7 in Section 6.3), we have 


Me)= 0 Ese (3). 


For every € > 0 there exists a number 2p = 2o(¢€) such that the remainder 
function r(x) satisfies the inequality |r(a)| < ¢ for all x > x. If k < «/a0, 
then 2/k > xp and |r(x/k)| < ¢. If k > x/axo, then x/k < xp and 0 < 
S'p(a/k) < Sp(ao). Therefore, 


A(z) = HR) (=) 


= ee OG) eG 


a/xo<k<ax 
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n0r u(k) | wax wk) a 
~ 6 ete (Zz) 
ae k<a/2o 
Hoe eG) 

L/ao<k<a 


We estimate these three terms separately. By Theorem 6.17, 


y Ww aS- yy -5+0(2), 


k<a/xo k>a/xo 


and so 


n70r p(k) 
ke 


= a2 + O(Zo) . 


k<a/xo 


Similarly, 


2 k 2 
Fe D Se Deo 


k<a/xo k<a/xo 


The third term is bounded independently of x, since 


k x 1 
ee e(@)| < Seo S| 
a/xo<k<a a/ao<k<ax 

< 2SzB (x0) log xo 


Therefore, 
A(x) = ax + O(ex) + O(a) ~ ax. 


This completes the proof. 


Exercises 


We can use the Taylor series for the generating function for the unre- 
stricted partition function p(n) to obtain a simple proof of the upper bound 


log p(n) < con. 
1. For 0 < x < 1, let 


f(z) = ][Q-2") t= 0 p(m)z” 
n=1 n=0 
Prove that 
oo k 
log p(n) + nlog x < log f(x) = Se TSE 


> 
Il 
un 
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2. Prove that if 0 <a <1, then 
1-2" > ka*-1(1— 2) 
and 
log f(x) < 6-2) 


3. Prove that if 0 < a <1, then 


1 
—loga < ; 
x 


and so 


4. Prove that log p(n) < con. 
Hint: Choose x € (0,1) such that 


16.3. Abelian and Tauberian Theorems 


In this section we derive the two results about power series with nonnegative 
coefficients that were used to deduce Theorem 16.2. The proofs require only 
advanced calculus. To the sequence B = {bn }°2 9 of real numbers we can 
associate the power series f(x) = )7*° 9 bnx”. We shall assume that the 
power series converges for || < 1. We think of the function f(x) as a kind 
of average over the sequence B. In rough language, an Abelian theorem 
asserts that if the sequence B has some property, then the function f(a) 
has some related property. Conversely, a Tauberian theorem asserts that if 
the function f(a) has some property, then the sequence B has a related 
property. 
The following result is an Abelian theorem. 


Theorem 16.3 Let B = {bn}, be a sequence of nonnegative numbers 
such that the power series f(x) = ~~ 9 bnx” converges for |x| < 1. If 


log b, ~ 2Van as n— oo, (16.4) 
then 


log f(x) ~ — asv—1-. (16.5) 
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Proof. Let 0 < ¢ < 1. The asymptotic formula (16.4) implies that there 
exists a positive integer No = No(e) such that 


ERY AR et neh FEO forall iy Np 


The series f(x) converges for || < 1 (by the root test), but diverges for 
x= 1. For0 <2 <1 we let x =e, where t = t(x) = —logz > 0, and t 
decreases to 0 as x increases to 1. 

First, we derive the lower bound 


lim inf(1 — «) log f(x) >a. 


For n > No, 
b,x” > e2(l-e) Van .—in = e2(l-e)Von—in 


Completing the square in the exponent, we obtain 


2(1 —e) an yo Cee (vn ey 


and so : ebigecs 
neite 1-<)/a 
b,x” > oe - et (vE- é ) : 
Choose to > 0 such that 
(“ - =, 


Ny At: 
to 


and let x9 = e~* € (0,1). Let a9 <a <1. Ifx=e', then 0 <t < to. Let 


Then 2 2 
1 iS 
we (CRY sone 
and 
. = 
RPE oy (E PM) eves 
It follows that 2 
Pe 
( af ve) ar 
and so dante (1-2) Va \2 (1-e)? a? 
bn, v"* > SS ee) Soe 
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Since b,x” > 0 for all n > 0, we have 


oe Ae 2a 
f(z) = S- byt” > by, x"? > goer 


n=0 
Therefore, 
1-— 2 
, (b= 2)Pa 
t 


log f(x) at 


and 
tlog f(x) > (1—«)°a—t?. 


By Exercise 1, 
t=-logr~l—-a asx—1-, 


and so 


I 


lim inf(1 — 2) log f(a) lim inf t log f(z) 
x—1— Ed a 


IV 


pT ((1 —€)?a — 2?) 


(1—«)7a. 


I 


This is true for 0 < e < 1, and so 


lim inf(1 — x) log f(x) >a. 


Next we derive the upper bound 


lim sup(1 — x) log f(a) < a. 


x17 
We have 
Co 
f(z) = See 
n=0 
No-1 Co 
ae S- b,x” + S- e2(ite)Van-tn 
n=0 n=No 
Gee we -+( ya-Gtpve)? 
< ci(e) +e t oS € : ’ 
n=No 
where 
No-1 No-1 
0< 35 b,2” < SS by = c1(€). 
n=0 n=0 
Let 


n= mio [1]. 
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Then 
4a < t(Ny + 1) 


If n > Ni, then 


and 


It follows that 


and so, as t > OF, 


Co 


on ee 
n=N,+1 n=N,+1 
e-t(Ni41)/4 
~ “Joe 
e4a/t 
1—e-t/4 
8e—4a/t 
te 
t 
= o(1), 
since 1—t/4 < e~*/4 < 1-1/8 for 0 <t <1. Also, 
MN 5 
wy me teva 1 
yee ry) 2h 
n=No 
Consequently, 
ea (16 
fe) < ale+e (EP + oft) 
<i. 24q 
c(e)e te 
< 
S P 
Therefore, 
1 2 
log f(a) < PE 4 Jog 2E) 
t ? 
and 
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Then 


lim sup(1 — z) log f(x) = limsup tlog f(x) < (1+¢)?a. 


x1 t—0+ 


This is true for every ¢ > 0, and so 


lim sup(1 — x) log f(x) < a. 
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This completes the proof. 
Next we prove a Tauberian theorem about power series with real, non- 
negative coefficients. 


Theorem 16.4 Let B = {b,}°29 be a sequence of nonnegative real num- 
bers. If the power series 


f(z) = Ss? dna” 
n=0 


converges for |x| <1 and if 


then 


Proof. We begin by showing that for every polynomial p(x) we have 


lim (1 — 2) S- bnx”p(a”) = | p(a)dx. (16.6) 


1- 
a n=0 


Since both sides are linear in p(x), it suffices to prove this for p(x) = 2x*. 


We have 


(1-2) S- bnz”p(2”) = (1-2) > byw” a®” 
n=0 n=0 


1 = lo) 
ies = (he att") ys bya 
—2 
n=0 


1 co 
= ay ye tae 
n=0 


l+a+---+ah 


and so 


Jim (1— 2) } 7 bna™p(a") 


n=0 
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lim ——— lim (1- 2**1 bn (a +1)” 
Pic Renerareerar ea re v pe er) 


1 [ ka 
— = x : 
are ea aac 


This proves (16.6). 

Next we use the Weierstrass approximation theorem: If f(x) is a con- 
tinuous function on the interval [0,1] and if « > 0, then there exists a 
polynomial p(x) such that 


I 


\f(a) — p(a)|<e for all x € [0,1]. 
Let ft (x) = f(x) +¢/2, and let p*(x) be a polynomial such that 
|ft (x) — pt(x)| <e/2 for all x € [0,1]. 


Then 
f(x) < pt(x) < f(z) +e for all x € [0,1] 


and : ‘ ‘ 

[ f(x)dx < | pt (x)dx < | f(x)dx +e. 
Similarly, there exists a polynomial p~ (x) such that 

f(a) —e<p (x) < f(x) for all a € [0,1] 
and 


1 fine eS [ rae < a + Cee 


Consider the function 


gle) = { 


1 1 
| g(x)dx = Gees 1. 
0 


e-1 & 


for0<a2<e}, 
for e~!<a<l. 


8 |r OC 


Then 


The function g(x) is continuous for all x € [0,1] except for x = e~!, where 
it has a jump discontinuity, and so we cannot apply Weierstrass’s theorem 
directly to approximate g(a) from above and below by polynomials. We 
circumvent this difficulty in the following way. Let 0 < ¢ < e~!. Define the 
function f+ (x) as follows: 


5 forO<a<e7!l-e, 
fir@=< O(a) fore t-e<a<ert, 
t+6 fore! <a<1, 
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where ¢* (x) is the straight line with end points (e~!—«,¢/2) and (e~!,e+ 
e/2). Then f*(a) is a continuous function on the interval [0,1], and so 
there exists a polynomial pt (x) such that 


g(@) < fe) <pt (ay < f(a) + 3 
for all a € [0,1]. Then 


E for0O<a<el-—e, 
O<pt(z)<4 ete foret—e<a<e! 
t+e foret<a<l, 


and so 


1 
1 = f oa)de 
0 
1 
aoa 


0 
et a 
= : ar x)dx +f pt (x)dx +f pt (x)dx 
0 ele e-t 
< e(e '—e)+(etele+1t+e(1—e") 
= l1l+(et+l)e. 
Similarly, we define the function f~ (a) as follows: 
s for0<a<e-! 
f-(@)=¢ € (2) foret<a<e l+e 
i_¢€ fore-tte<a<l, 


where ¢~(x) is the straight line with end points (e~!,—e/2) and (e~! + 
e,1/(e~' —e/2). Then f~ (2) is a continuous function on the interval (0, 1], 
and there exists a polynomial p7 (x) such that 


= E ms = 
f (2) —5 <P (2) < fF (2) < oz) 
for all x € [0,1]. It follows that 


i= i mene 


> [ wae 
e+e 1 
> | (-e)ae+ fo Go) dx 
—e(e~! +e) —log(e™? +e) —e(1—e7 1 —€) 
1—e-log(1 + ee) 
> 1l-(et+le. 
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The inequality p~ (x) < g(x) < p*(a) implies that for 0 <2 <1, 
(1-2) S- bnx"p (2") < (1-2) ss bra” g(a”) 
n=0 n=0 
< (1 o x) oy bap a”). 
n=0 
By (16.6), 
1 
1= +e < | p (t)dt 
0 


= lim (1-2) > bnx” p(x”) 


x21 


n=0 
< liminf(1 — byx” g(a” 
< liminf( )d, x"g(a") 
< limsup(1 — 2) S- bnt” g(a") 
xr—1- n=0 
< lim (1- b,x" pt (a” 
< lim ( *) x™p* (a) 
1 
= | pt (a)dx 
0 


< l+(et+lLe. 


These inequalities hold for all sufficiently small ¢, and so 


lim (1 — x) y bya g(a*) = 1. 
r—1— 
k=0 


Let 
r=e ln, 
Then 0 < # < 1, and 


if and only if 


It follows from the definition of the function g(a) that 


>) baa*g(e*) = $7 bee*g(a*) = So ba, 
k=0 k=0 k=0 
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and so 
n 


lim (1—e7/") Sy = 1, 


n—0o 
k=0 


” 1 
So bk ~ —1/n° 
k=0 eet 


that is, 


From the inequality 


ae 
l-z<e*<l-a#t+> 


with x = 1/n, we obtain 


and so 


as n — oo. Therefore, 


This completes the proof. 


Exercises 


1. Prove that 
—logr~l—-ax asx—-l. 


2. Let B = {bn }92, be a sequence of real, nonnegative numbers such 
that the power series f(x) = >>> 9 bnx” converges for |x| < 1. Prove 
that if ae 

eer O08 On 
l f —-— > 
Want ayn =v 
then 
lim inf(1 — x) log f(a) > a. 
w—17 


3. Let B = {b,}°2, be a sequence of real, nonnegative numbers such 
that the power series f(x) = )>>~ bnx” converges for |a| < 1. Prove 
that if 


log by 
lim sup < Ja, 


then 
lim sup(1 — x) log f(a) < a. 


x17 
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16.4 Notes 


Theorem 16.1 and Theorem 16.2 show that a set A with gced(A) = 1 has 
positive density a if and only if logp,4(n) ~ co./an. Erdés states these 
results, with a sketch of a proof, in his paper [32], where Theorem 16.3 is 
also stated and applied. The proofs in this book appear in Nathanson [105, 
106]. 

Theorem 16.4 is a famous Tauberian theorem of Hardy and Littlewood [53]; 
the proof in this book is due to Karamata [77]. Titchmarsh [142, Chapter 
7| discusses this and many related results. 

Using hard analytic machinery, Freiman [36], Kohlbecker [84], and Yang [158] 
have obtained other inverse theorems for partitions. 

We know the asymptotics of partition functions for certain sets of integers 
of zero density. For example, Hardy and Ramanujan [57] proved that if A 
is the set of kth powers of positive integers, then 


1 1 1 k/(k+1) 
log pac (n) ~ (k+ 1) {zr (; + i) ¢ (; a i)} nV (k+1) 


where I'(s) is the gamma function and ¢(s) is the Riemann zeta function. 
This gives (15.2) in the special case k = 1. In the same paper, they also 
proved that if P is the set of prime numbers, then 


n 
l 1 
ogpp(n) ~ 2m |r, 


and if P“*) is the set of kth powers of primes, then 


1 1 k/(k+1) ‘ 1/(k+1) 
log pos (n) ~ en {t (+2) ; G+) (Tan | . 
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